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(57) Abstract: Methods and compositions for identifying mutations and polymorphisms in mutant genes encoding gene product 
involved in insulin secretion, for example, hepatocyte nuclear factor-loc, glucokinase, amylin and mitochondrial DNA are disclosed. 
Specifically, a microchip comprising a combination of at least two different mutant genes wherein each gene comprises at least one 
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microchip, an isolated nucleic acid, primers and probes which are specifically used to screen or identify the mutations in genes of 
hepatocyte nuclear factor-l°s glucokinase, amylin and mitochondrial DNA are also disclosed. 
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METHODS AND COMPOSITIONS FOR EVALUATING RISK OF DEVELOPING 
TYPE 2 DIABETES IN PEOPLE OF CHINESE DESCENT 

INTRODUCTION 

Field of the Invention 

This subject invention relates to the identification and use of mutations and 
polymorphisms in mutant genes of wild-type genes involved in insulin secretory function 
that are associated with the increased risk of a Chinese individual to develop type 2 
diabetes. The invention is exemplified by a combination of mutations, uniquely identified 
in Chinese individuals with a positive family history of type 2 diabetes, in the genes 
encoding hepatocyte nuclear factor-la, glucokinase, amylin and mitochondrial DNA. The 
combination of mutated genes finds use in screening Chinese individuals at risk of 
developing type 2 diabetes and in providing physicians with information to enable them to 
apply patient tailored therapies. 

Background 

Although people of Chinese ancestry account for >20% of the world's population 
(Chan, et al. (1997) 20:1785), very little is known about the genetic factors that contribute 
to the development of diabetes in this population. The prevalence of diabetes amongst 
Chinese people varies from < 1 % in some rural areas in mainland China to 6-12% in Hong 
Kong, Singapore, and Taiwan (Chan, et al. (1997), supra). Hong Kong can be regarded as 
a paradigm of future China. 

The prevalence of diabetes mellitus is reaching epidemic proportions amongst Hong 
Kong Chinese, with type 2 diabetes being the predominant form in pateints with early- or 
late-onset of disease (Chan and Cockram (1997) Diabetes Care 20:1785). Type 2 diabetes 
mellitus is a heterogeneous disease that is caused by both genetic and environmental 
factors. The age-adjusted prevalence of diabetes in the Chinese population has increased 
from 7.7% in 1990 (Cockram, et al. (1993) Diabetes Res and Clin Practice 21 :67) to 8.9% 
in 1995 (Cockram and Chan (1999) In: Diabetes in the New Millennium, Pot Still Press, 
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Sydney, pp. 11-22). In a population-based study conducted in 1995, the crude prevalence 
of diabetes mellitus was 9.6%, rising from 1.7 % in those aged under 40 years to 25% in 
those older than 60 years (Janus (1997) Clin Exp Pharmacol Physiol 24:987). There is a 
high prevalence of obesity (43%) and positive family history of diabetes (50%) in Chinese 
patients presenting with acute or early onset diabetes (Chan, et al (1993) Postgrad Med J 
69:204; Ko, et al (1998) 35:761). These findings indicate that genetic factors, in addition 
to environmental factors, can be an important cause of early onset diabetes in this 
population. 

Because type 2 diabetes is an insidious disease, it is estimated that as many as half 
of the individuals in Hong Kong that would be considered diabetic remain undiagnosed. 
Most patients are finally diagnosed only when presenting with overt symptoms that often 
are the consequence of advanced disease. Clinic as well as population-based studies reveal 
that about 17% of diabetic patients in Hong Kong are diagnosed before age 35 years (Chan, 
et al (1993) Postgrad Med 69:204; Janus (1996) The Hong Kong cardovascular risk factor 
prevalence study 1995-1996 Dept of Clin Biochem, Queen Mary Hospital of Hong Kong, 
Hong Kong, 1997). Due to their anticipated long duration of disease,it is important to 
classify and characterize the nature of diabetes in these young patients to facilitate early 
diagnosis and appropriate treatments. Current methods of diagnosing type 2 diabetes 
generally involve assessing phenotypic parameters, such as measuring fasting serum 
glucose levels by administering an oral glucose tolerance test (OGTT) to determine 
impaired glucose tolerance (IGT) or impaired fasting glucose (DFG). Phenotypic 
assessments of persons suspected of having type 2 diabetes are important, but they are 
limited in that patients generally receive a diagnosis only after presentation with overt 
symptoms. Furthermore, because the common symptoms of type 2 diabetes are a 
consequence of a combination heterogenous genetic and environmental causes, the therapies 
provided are general with regard to the disease rather than targetted to the specific etiology 
of the individual patient. Numerous studies have attempted to correlate the increased risk 
for development of type 2 diabetes with a mutation of a specific gene, but the results of 
these studies repeatedly demonstrate that no one mutated gene can be attributed as the 
major cause of type 2 diabetes, emphasizing the heterogeneous nature of this disease. 
Furthermore, a mutation in a particular gene that correlates with increased risk for 
developing type 2 diabetes in individuals of one ethnic population is not relevant to 
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individuals of a second ethnic population, wherein the risk for type 2 diabetes in 
individuals of the second ethnic population will correlate with a different mutation or a 
mutation in a completely different gene. 

It is therefore of interest to identify additional genetic mutations and polymorphisms 
that are indicative of an increased risk for developing type 2 diabetes in people of Chinese 
ancestry, and to develop methods that can be effectively employed to prophylactically 
identify asymptomatic Chinese individuals with a genetic predisposition for type 2 diabetes. 

Relevant Literature 

Maturity-onset diabetes of the young (MODY) is a monogenic form of diabetes 
characterized by autosomal dominant inheritance, early onset (usually before 25 years of 
age) and a primary defect in pancreatic p-cell function (Fajans (1990) Diabetes Care 13:49; 
Chan, etal (1990) Diabetic Med 7:211; Byrne, et al. (1996) Diabetes 45:1503). This 
form of diabetes can result from mutations in at least five different genes including those 
encoding the glycolytic enzyme glucolrinase (Froguel, et al (1993) New Engl J Med 
328:697), the liver-enriched transcription factors expressed in the pancreatic p-cell, which 
are hepatocyte nuclear factors HNF-la (Yamagata, et al. (1996) Nature 384:455), HNF-ip 
(Horikawa, etal. (1997) Nature Genet 17:384), and HNF-4a (Yamagata, etal. (1996) 
Nature 384:458), and insulin promoter factor-1 (IPF-1) (Stoffers, et al (1997) Nature 
Genet 17:138). 

Some mutations and polymorphisms in the glucokinase and HNF-la genes that are 
associated with the genetic predisposition of a Chinese individual to develop type 2 diabetes 
mellitus have been initially identified in Ng, et al {Diabetic Medicine 1999, 16:956, herein 
incorporated by reference), but this manuscript does not disclose how these mutations and 
polymorphisms might be used to identify Chinese individuals with increased risk of 
developing type 2 diabetes. 

USPN 5,541,060 discloses the results of screening a cohort of sixteen French 
families having MODY and the identification of several missense mutations in the 
glucokinase gene, however none of the mutations identified are relevant to individuals of 
Chinese descent. USPN 5,800,998 discloses a point mutation at nucleotide 414 of human 
HNF la, but this single point mutation is not associated with a genetic predisposition of a 
Chinese individual to develop type 2 diabetes. 
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Major susceptibility loci for non-insulin dependent diabetes have been identified 
through genome scans of individuals in Mexican-American (Hanis, et al. (1996) Nature 
Genet 13:161) and Finnish (Mahtani, et al. (1996) Nature Genet 14:90) populations, but 
not in individuals of a Chinese population. Specific microsatellite regions of genomic 
DNA can be correlated with major susceptibility loci that closely associate with the 
increased risk of a Chinese subject to develop type 2 diabetes. For instance, Le Stunff, et 
al. {Nature Genet (2000) 26:444) have reported that particular alleles of the insulin gene 
variable number of tandem repeat (VNTR) locus are associated with obesity and type 2 
diabetes. Also, microsatellite polymorphisms flanking the glucokinase have been 
associated with type 2 diabetes in a Taiwanese population (Wu, et al. (1995) Diabetes Res 
Clin Pract 30:21). 

SUMMARY OF THE INVENTION 

Compositions and methods are provided, wherein a unique combination of genetic 
markers indicative of a genetic predisposition for developing type 2 diabetes in members of 
a Chinese population is described. The invention is exemplified by a combination of 
mutated gene sequences from wild-type genes that are involved in insulin secretory 
function, including hepatocyte nuclear factor la (HNF-la), glucokinase, amylin and 
mitochondrial DNA. The combination of representative mutations include G20R, A116V, 
IVS2nt-»GA, R203H, S432C and I618M of HNF-la; V101M, I110T, A119D, Q239R and 
G385V of glucokinase; S20G of amylin; and A3243G of mitochondrial tRNA 1 ^^. The 
combination of the mutated genes of interest will be most efficiently used for screening 
individuals at increased risk by attaching them to a microchip. 

Embodiments of methods for determining or detecting the genetic predisposition of 
a Chinese individual to develop type 2 diabetes include obtaining a sample containing 
genomic nucleic acid from a Chinese patient, such as a tissue biopsy or a blood sample, 
and contacting that sample with a representative combination of at least two mutated genes 
of interest, then subjecting the sample DNA together with the patient's DNA to 
hybridization conditions stringent enough to detect nucleotide differences of at least one 
base pair. Alternatively, particular genes of interest from the genomic DNA of a Chinese 
individual at risk are screened using PCR primer pairs and PCR-RFLP techniques to 
identify the presence or absence of a mutation known to be associated with type 2 diabetes. 
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The methods farther encompass screening the genomic DNA of Chinese individuals who 
have been diagnosed with type 2 diabetes or who have a primary family member with type 
2 diabetes for additional associative mutations in identified genes or for mutations 
correlative with the predisposition of a member of a Chinese population to develop type 2 
diabetes in additional candidate genes, such as those associated with diabetic kidney disease 
and obesity. 

The invention further provides for nucleic acid primers and probes that are 
specifically used to identify mutations, for instance by PCR or hybridization, of wild-type 
genes involved in insulin secretion that are associated with an increased risk of a Chinese 
subject to develop type 2 diabetes. Additionally, proteins translated from genes carrying at 
least one mutation associated with increased risk of a Chinese individual to develop type 2 
diabetes find use in functional diagnostic assays and in the production of diagnostic 
antibodies that bind to the mutant but not the wild-type protein. 

The prophylactic detection of mutations and polymorphisms that are indicative of a 
genetic predisposition of a Chinese individual to develop type 2 diabetes finds application 
in providing clinicians with information that allows for early detection and therapy 
initiation before the onset of overt symptoms or complications, and that enables clinicians 
to administer specifically targetted therapies that address the etiology of an individual's 
disease. 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1A shows the nucleic acid sequence of human nuclear factor la (HNF-la) 
exon 1 with the G20R mutation (SEQ ID NO:l). The wild-type sequence is GenBank 
number U72612. Figure IB shows the nucleic acid sequence HNF-la exon 2 with the 
A116V mutation (SEQ ID NO:2). The wild-type sequence is GenBank number U72613. 

Figure 2 shows the nucleic acid sequence of HNF-la exons 3 and 4 depicting the 
splice acceptor site mutation IVS2nt-lG-»A (SEQ ID NO:3) and the missense mutation 
R203H (SEQ ID NO:4). The wild-type sequence is GenBank number U72614. 

Figure 3 A shows the nucleic acid sequence of HNF-la exons 5 and 6 with the 
S432C mutation (SEQ ID NO:5). The wild-type sequence is GenBank number U72615. 
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Figure 3B shows the nucleic acid sequence of HNF-loc exon 10 with the I618M mutation 
(SEQ ID NO:6). The wild-type sequence is GenBank number U72618. 

Figure 4A shows the nucleic acid sequence of human glucokinase exon 3, depicting 
the mutations V101M (SEQ ID NO:7), I110T (SEQ ID NO:8) and A119D (SEQ ID 
NO:9). The wild-type sequence is GenBank number AF041016. Figure 4B shows the 
nucleic acid sequence of human glucokinase exon 7 with the Q239R mutation (SEQ ID 
NO: 10). The wild-type sequence is GenBank AF041019. Figure 4C shows the nucleic 
acid sequence of human glucokinase exon 9 with the G385V mutation (SEQ ID NO: 11). 
The wild-type sequence is GenBank number AF04102L 

Figure 5 shows the nucleic acid sequence of the human amylin gene exon 3 with the 
S20G mutation (SEQ ID NO: 12). The wild-type sequence is GenBank number X52819. 

Figure 6 shows the nucleic acid sequence base pairs 3001-3480 of the human 
mitochondrion complete genome, depicting the A3243G mutation (SEQ ID NO: 13). The 
wild-type sequence is GenBank number J01415. 

Figure 7 shows the pedigrees of families with mutations/polymorphisms in the 
glucokinase (HK84) or HNF-la gene (HK10 and HK54). Individuals with diabetes are 
noted by filled symbols; individuals with impaired fasting glucose (IFG) or impaired 
glucose tolerance (IGT) by grey symbols; non-diabetic individuals by open symbols and 
untested by hatched symbols. The arrow indicates the proband. Present age, age at 
diagnosis and genotype of glucokinase or HNF-la of tested individuals are noted: N, 
normal; M, mutation/polymorphism. 

Figure 8 shows the pedigrees of families with an mt3243 mutation. Individuals 
with diabetes are noted by filled symbols, IGT by grey symbols, non-diabetic individuals 
by open symbols, and untested individuals by hatched symbols. The arrow indicates the 
proband. Present age, age of diagnosis, audiogram and genotype are also shown. N, 
normal; M, mutant allele. 

Figure 9A-9J show the pedigrees of 10 families carrying the HNF-la (9A-9B), 
glucokinase (9C-9E), mt3243 (9F-9H) or amylin S20G (9I-9J)gene mutations/ 
polymorphisms. Subjects with diabetes are noted by black symbols, subjects with IFG or 
IGT by grey symbols, non-diabetic and untested subjects by open symbols. The genotype 
of the family members is indicated by: N, wild-type allele; and M, mutant/variant allele. 
Present age, age at diagnosis, therapy and complications are stated in this order. The 
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proband is indicated by an arrow. Abbreviations: Oral, oral drugs; Ins, insulin; R, 
retinopathy; K, albuminuria; U, neuropathy; H, hearing impairment. 

Figure 10 shows the pedigree of a Chinese family with HNF-la IVS2nMG->A 
mutation. Subjects with diabetes are represented by black symbols, subjects with IGT by 
grey symbols and untested ones by open symbols. The genotype of family members is 
indicated: N, normal allele; and M, mutant allele. The proband is indicated by an arrow. 
CP, C-peptide; GST, glucagon stimulation test; Complications: R, retinopathy; K, 
nephropathy; U, neuropathy, 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
Compositions and methods are provided, wherein a unique combination of genetic 
markers indicative of a genetic predisposition for developing type 2 diabetes in members of 
a Chinese population is described. The invention comprises as compositions: (1) a 
combination of nucleic acid sequences from wild-type genes that encode proteins important 
for insulin secretory function, each nucleic acid sequence having a mutation uniquely 
associated with the genetic predisposition of a Chinese individual to develop type 2 
diabetes, (2) nucleic acid sequences encoding glucokinase and HNF-la and carrying 
previously unreported mutations indicative of the increased risk of a Chinese person to 
develop type 2 diabetes, (3) a microchip having attached to it at least two of the mutated 
genes of interest, and (4) nucleic acid primers used to detect the unique mutations in the 
genes of interest. The methods involve: (1) obtaining genomic DNA from a Chinese 
subject, (2) combining the genomic DNA with either a combination of the mutated nucleic 
acids of interest or a combination of primers used to identify the presence or absence of a 
mutation in a gene of interest, and (3) detecting for the presence or absence of mutations, 
either by identifying mismatches between the patient's DNA and a wild-type or mutant 
nucleic acid sequence by hybridization techniques, or by amplifying regions of the patients 
DNA that contain putative mutations by PCR, and subjecting the amplicons to restriction 
endonucleases and/or DNA sequencing. 

Advantages of the present invention include that the method of screening uses 
genetic markers shown to cosegregate with type 2 diabetes in persons of Chinese ancestry 
to assess whether a given patient is at increased risk for developing type 2 diabetes. The 
mutations and polymorphisms used for screening are specifically applicable to individuals 
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of Chinese descent. As a further advantage, the screening can be based upon the presence 
or absence of a combination of at least two different mutations or polymorphisms to 
provide for even more accurate and reliable evaluations because the contributing factors to 
development of type 2 diabetes are heterogeneous. Identification of particular mutations or 
polymorphisms in an individual offers the advantage that with this information physicians 
are able to provide more specific and appropriate therapies for individual patients, and to 
guide a patient in making lifestyle adjustments to ameliorate or delay symptoms of diabetes 
and associated complications. Because type 2 diabetes is often an insidious disease, 
representative combinations of genetic markers indicative of a predisposition in a Chinese 
individual to develop the disease can be used to screen populations of individuals who may 
be at increased risk for developing type 2 diabetes so that they can be given appropriate 
therapy before overt diabetic symptoms or complications are realized. Likewise, family 
members of an individual diagnosed with type 2 diabetes can be screened for the particular 
mutants/polymorphisms of the affected individual to quickly identify family members also 
at increased risk of developing type 2 diabetes. 

By a member of a Chinese population is intended to include any individual of 
Chinese ancestry. In certain cases, for instance when a mutation in a gene involved in the 
secretion of insulin is dominant for increasing the risk of a Chinese individual to develop 

type 2 diabetes, a member of a Chinese population will encompass those individuals with at 

j 

least one parent of Chinese descent. A member of a Chinese population may be more 
specifically identified by HLA haplotyping. For example, HLA class I and class II 
frequencies among a Hong Kong Chinese population have been studied by Chang and 
Hawkins (Hum Immunol (1997) 56:125). Numerous studies have been carried out to 
determine HLA class I and class II alleles that are more frequendy or even uniquely found 
in members of a Chinese population, and alleles with strong associations. Shaw et al. and 
Shen et al. have studied HLA polymorphism and allele frequency and association of 
Chinese populations in Taiwan (Tissue Antigens (1997) 50:610; Tissue Antigens (1999) 
53:51; J Formos Med Assoc (1999) 98:11). Allele frequency and associations found in 
Chinese individuals of mainland China have been reported by Trejaut et al. ( Eur J 
Immunogenet (1996) 23:437), Shieh, etaL (Transfusion (1996) 36:818), Zhao et al (Eur J 
Immunogenet (1993) 20:293), Wang, etaL (Tissue Antigens (1993) 41:223; Hum Immunol 
(1992) 33:129), Lee, et al. (Eur J. Immunogenet (1999) 26:275), and Gao etaL (Hum 
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Immunol (1991) 32:269; Tissue Antigens (1991) 38:24; Immunogenetics (1991) 34:401). 
Additionally, a Chinese individual may be objectively defined by U DNA fingerprinting " 
techniques well known to those in the art, where microsatellite, short tandem repeat (STR) 
and variable number tandem repeat (VNTR) loci specific to individuals of Chinese descent 
are identified. Numerous examples of such ethnic genotyping studies have been reported 
(Meng, et al. (1999) J Forensic Sci 44:1273; Yoshimoto, et al. (1999) IntJLegal Med 
1 13: 15; Wu, et al (1999) J Forensic Sci 44: 1039; Evett, et al (1996) Am J Hum Genet 
58:398; Gill and Evett (1995) Genetica 96:69; Balazs (1993) EXS 67:193; Lan, etal 
(1992) ArchKriminol 189:169; and Hwu, etal. (1992) J Formos Med Assoc 91:839). All 
of these above references are incorporated herein by reference. 

Whereas insulin resistance is a strong predictor of type 2 diabetes, it is not 
sufficient for manifestation of the disease (So, et ah (2000) Hong Kong J. Med 6:69-76). 
A relative insulin deficiency is essential to the development of hyperglycemia, setting up a 
vicious cycle wherein elevated glucose levels are toxic to pancreatic P-cells, thereby 
inducing insulin resistance and decreased p-cell secretory function. Based in the intrinsic 
interconnection between insulin secretion and action, the invention is exemplified by a 
combination of mutated gene sequences from wild-type genes that are involved in insulin 
secretory function, including hepatocyte nuclear factor la (HNF-la), glucokinase, amylin 
and mitochondrial DNA. By "genes involved in insulin secretory function" and "genes 
involved in insulin secretion" is intended genes in which a heterozygous mutation has a 
dominant-negative effect on normal pancreatic p-cell secretory function. The invention is 
primarily concerned with a representative array of gene markers, the combination of which 
is uniquely indicative of the genetic predisposition of a member of a Chinese population to 
develop type 2 diabetes (referred to hereinafter as "genes of interest"). The combination of 
representative mutations is exemplified by G20R, A116V, IVS2nt-»GA, R203H, S432C 
and I618M of HNF-la; V101M, I110T, A119D, Q239R and G385V of glucokinase; S20G 
of amylin; and A3243G of mitochondrial tRNA Lcu(UUR) . The mutation IVS2nt->GA 
represents a splice acceptor site mutation that likely results in a truncated translation 
product. 

A representative combination of the mutated genes of interest finds particular use in 
the prophylactic screening of (i) Chinese individuals who have been diagnosed with 
maturity onset diabetes of the young (MODY) to determine the etiology of their disease, 
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(ii) Chinese individuals that have a positive family history of type 2 diabetes to determine 
their likelihood of developing diabetic symptoms, and (iii) and Chinese individuals deemed 
to be at greater risk of developing diabetic symptoms because of correlative phenotypic 
characteristics (i.e. obese individuals). 

The combination of the mutated genes of interest will be most efficiently used for 
screening individuals at increased risk by attaching them to a microchip or other solid 
support. A specific kind of microchip is not critical, except that it must be able to present 
a representative array of at least two different nucleic acid sequences, each with a mutation 
or polymorphism indicative of the increased risk of a Chinese individual to develop type 2 
diabetes. Additionally, it will be useful to attach representative wild-type nucleic acid 
sequences to the chip as comparative controls. The microarray will normally involve a 
plurality of different nucleic acid sequences, usually be at least 10, more usually at least 
20, frequently at least 50, but may have as many as 100 or more. Chips that will find use 
with the present invention are known in the art (for example, see USPN 5,741,644, 
5,837,832 and 6,183,970). Additionally, other solid substrates may be used for the 
covalent attachment of representative combinations of mutated nucleic acid sequences of 
interest, including beads and slides. Solid supports can be made out of glass or silicon 
oxide or other materials that can be adapted to be covalently attached to oligonucleotide 
sequences by the introduction of functionalities which react with oligonucleotides. 

One may use a variety of approaches to bind the nucleic acid to the solid substrate. 
By using chemically reactive solid substrates, one may provide for a chemically reactive 
group to be present on the nucleic acid, which will react with the chemically active solid 
substrate surface. For example, by using silicate esters, halides or other reactive silicon 
species on the surface, the nucleic acid may be modified to react with the silicon moiety. 
One may form silicon esters for covalent bonding of the nucleic acid to the surface. Instead 
of silicon functionalities, one may use organic addition polymers, e.g. styrene, acrylates 
and methacrylates, vinyl ethers and esters, and the like, where functionalities are present 
which can react with a functionality present on the nucleic acid. For example, amino 
groups, activated halides, carboxyl groups, mercaptan grbups, epoxides, and the like, may 
be provided in accordance with conventional ways. The linkages may be amides, amidines, 
amines, esters, ethers, thioethers, dithioethers, and the like. Methods for forming these 
covalent linkages may be found in USPN 5,565,324 and USPN 6,156,501. 
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The invention also contemplates a microassay system and a kit that comprises a 
solid support having attached to it a representative array of nucleic acid sequences, each 
with a mutation or polymorphism associated with the genetic disposition of a Chinese 
individual to develop type 2 diabetes. The microassay system or kit would contain, for 
example, a microchip or beads to which are attached wild-type and mutant nucleic acid 
sequences from genes that encode proteins involved in insulin secretory function* 
preferentially wild-type and mutant sequences from HNF-la, glucokinase, amylin and 
mitochondrial DNA. Additionally, mutant and wild-type sequences known to hybridize 
and known not to hybridize under stringent conditions to those sequences immobilized on 
the support would be included as positive and negative controls, respectively. A 
microassay system or kit with nucleic sequences immobilized on a solid support would 
involve screening by hybridization detection (fluorescent or radioactive signal upon duplex 
formation). Alternatively, another microassay system or kit would include primer pairs 
that anneal to nucleic acid sequences encoding proteins involved in insulin secretion. The 
primer pairs specifically anneal to flanking regions of the genes that putatively contain 
mutations associated with type 2 diabetes, such that PCR amplification with such primers 
would reveal the presence or absence of an associative mutation of interes. Such a kit or 
microassay system would also contain representative mutant and wild-type sequences as 
controls, and screening would be carried out using PCR and sequencing or through PCR- 
restriction fragment length polymorphism analysis (RFLP) and electrophoresis. 

Type 2 diabetes is a heterogeneous disease, and no single mutation or single 
mutated gene can be fully attributed to the manifestation of its symptoms. Therefore, a 
combination of at least two different nucleic acid sequences encoding mutations or 
polymorphisms of closely associated with increased risk of a person of Chinese descent to 
develop type 2 diabetes is attached to a microchip or is individually screened. By "at least 
two different nucleic acid sequences" is intended two different nucleic acid sequences from 
the same wild-type gene having different mutations, or two different mutant nucleic acid 
sequences from two different wild-type genes. Preferably, at least one of the mutant 
sequences A116V of HNF-la (SEQ ID NO:2), V101M (SEQ ID NO:7) or Q239R (SEQ 
D NO: 10) are attached to the microchip or solid support. By wild-type gene is intended 
»ne that is not associated with type 2 diabetes, and this would include any allelic variant of 
le wild-type gene, at any frequency, and that encodes a protein that functions in its 
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expected manner without inducing pathological symptoms. By mutant gene is intended one 
whose sequence has been modified by insertions, deletions, or substitutions of at least one 
nucleic acid base pair, wherein the modification may result in detectable changes in the 
expression or function of the mutant gene product as compared to the wild-type gene 
product. In the genes of interest for the invention, a mutant gene is associated with type 2 
diabetes. The nucleic acid sequences may be from genomic DNA, complementary DNA 
(cDNA) or from messenger RNA (mRNA). They may be synthetic or isolated from human 
bodily tissue or fluid. The mutations preferably occur, but do not need to occur, in a 
translated region of a nucleic acid sequence that encodes a protein that in wild-type form is 
involved in glucose metabolism or insulin secretion. 

Within a translated nucleic acid sequence, a mutation can be a missense mutation, 
replacing one amino acid with another amino acid, or a nonsense mutation, replacing an 
amino acid with a stop codon. Mutations can also be insertions or deletions of at least one 
nucleic acid in either a coding or in non-coding region, such as a region that controls the 
transcription of a gene, including promoters, enhancers, response elements, signal 
sequences and polyadenylation signals, and the like. Single nucleotide polymorphisms 
(SNPs), preferably but not necessarily occurring within the translated regions of nucleic 
acid sequences that encode proteins involved in glucose metabolism or insulin secretion and 
that correlate with increased risk of type 2 diabetes are also contemplated by the present 
invention. Such SNPs can be identified by correlating mutations in known genes that 
cosegregate with development of type 2 diabetes in members of families with a positive 
history of the disease. Additionally, SNPs that occur in non-translated and translated 
regions can be identified through genome-wide scans and correlate linkage analyses of 
family pedigrees. The use of microarray technologies also can be conveniendy applied to 
identifying SNPs of interest. 

Embodiments of methods for determining or detecting the genetic predisposition of 
a Chinese individual to develop type 2 diabetes include obtaining a sample containing 
genomic nucleic acid from a Chinese patient, such as tissue from autopsy or biopsy, or a 
blood sample, and contacting that sample with a representative combination of at least two 
mutated genes of interest, then subjecting the sample DNA together with the patient's DNA 
to hybridization conditions stringent enough to detect nucleotide differences of at least one 
base pair. Alternatively, particular genes of interest from the genomic DNA of a Chinese 
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individual at risk are subjected to restriction fragmentation and then screened using PGR 
primer pairs and PCR-RFLP techniques to identify the presence or absence of a mutation 
known to be associated with type 2 diabetes. The methods further encompass screening the 
genomic DNA of Chinese individuals that have been diagnosed with type 2 diabetes or that 
have a primary family member with type 2 diabetes for additional associative mutations in 
identified genes or for mutations correlative with the predisposition of a member of a 
Chinese population to develop type 2 diabetes in additional candidate genes, such as those 
associated with diabetic kidney disease and obesity. Mutations are most efficiently 
identified in Chinese families with a positive history of developing type 2 diabetes (i.e. 
families with members that develop MODY). However, identified associative mutations 
are useful for identifying the increased risk in any member of a Chinese population. 

In practicing a method of identifying the mutations associated with the genotype of a 
Chinese individual who is at increased risk for developing type 2 diabetes, Chinese subjects 
with (i) a confirmed diagnosis of type 2 diabetes, (ii) a positive familial history of type 2 
diabetes or (iii) phenotypically determined elevated risk factors (e.g. obesity) are identified 
by clinical testing, pedigree analysis, and linkage analysis, using standard diagnostic 
criteria and interview procedures, and DNA or RNA samples are obtained from the 
subjects. 

A sample of genomic DNA is obtained from any nucleated cell source or body 
fluid. Examples of cell sources available in clinical practice include blood cells, buccal 
cells, cervicovaginal cells, epithelial cells from urine, fetal cells, or any cells present in 
tissue obtained by biopsy. Body fluids include blood, urine, cerebrospinal fluid, amniotic 
fluid, and tissue exudates at the site of infection or inflammation. DNA is extracted from 
the cell source or body fluid using any of the numerous methods that are standard in the 
art. It will be understood that the particular method used to extract DNA will depend on 
the nature of the source. 

A variety of techniques are then employed to identify the presence or absence of 
lew or known mutant sequences. First, the sequences of genes known to be involved in 
nsulin secretory function may be subjected to direct DNA sequencing, using methods that 
re standard in the art. Mutations may be detected using a PCR-RFLP, in which pairs of 
lignucleotides are used to prime amplification reactions and the sizes of the amplification 
roducts, cleaved or uncleaved by restriction endonuc leases, are compared with those of 
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control products. Other useful techniques include Single-Strand Conformation 
Polymorphism analysis (SSCP), denaturing gradient gel electrophoresis, and two- 
dimensional gel electrophoresis, EMC, and the like. Detection of known mutations, such 
as those exemplified by the invention, may alternatively be detected using nucleic acid 
probes that contain mutations of interest in sufficiently stringent hybridization conditions. 

Appropriate stringency conditions for identifying mutations of at least one base pair 
in a mutant sequence of a gene involved in insulin secretory function, for example, in 6X 
sodium chloride/sodium citrate (SSC) at at least 42°C, preferably at about 43, 44 or 45 °C, 
followed by a wash of 2X SSC at 50°C, are known to those skilled in the art or can be 
found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989). To 
optimize conditions, both salt and temperature may be varied, or either the temperature or 
salt concentration may be held constant while the other variable is changed. For example, 
the salt concentration in the wash step can be selected from a low stringency of about 2X 
SSC at 50 °C to a high stringency of about 0.2X SSC at 50°C. The temperature in the 
wash step can be increased from low stringency conditions at room temperature, about 
22 °C to high stringency conditions at about 65 °C. Optimal conditions will additionally 
depend on the length of the nucleic acid probe used, and the scale at which the 
hybridization takes place. High stringency hybridization conditions using nucleotides 
attached to a microchip may require lower temperatures. One can perform a series of 
routine thermal equilibrium experiments to determine optimal hybridization discrimination 
between wild-type and mutant gene sequences of interest, by starting a low stringency 
temperature of about 20 °C and increasing the temperature in successive 5 °C temperature 
intervals. 

The nucleic acid probes of the invention are nucleic acid sequences from the 
mutated genes of interest. They are at least 8, 12, 15 or 20 base pairs in length, but can be 
50, 80 or 100 base pairs in length, and may even be 250 or 500 base pairs in length, and 
include at least one associative mutation but may include multiple mutations, and can be as 
long as the length of the transcribed gene. The length of the probe chosen will be 
optimized based on the better base pair mismatch discrimination of shorter probes and the 
better duplex stability of longer probes {see USPN 6,156,601 and USPN 6,197,506, herein 
incorporated by reference). The length of the probe used should enable discrimination 
between a mutant and wild-type gene with at least one base-pair mutation. 
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For detection of hybridized probes, light detectable means are preferred, although 
other methods of detection may be employed, such as radioactivity, atomic spectrum, and 
the like. For light detectable means,. one may use fluorescence, phosphoresence, 
absorption, chemiluminescence, or the like. The most convenient will be fluorescence, 
which may take many forms. One may use individual fluorescers or pairs of fluoresces, 
particularly where one wishes to have a plurality of emission wavelengths with large Stokes 
shifts. Dlustrative fluorescers which have found use include fluorescein, rhodamine, Texas 
red, cyanine dyes, phycoerythrins, thiazole orange and blue, etc. When using pairs of dyes, 
one may have one dye on one molecule and the other dye on another molecule which binds 
to the first molecule. For example, one may have one dye on the first or bound component 
and the other dye on the second or complexing component. The important factor is that the 
two dyes when the two components are bound are close enough for efficient energy transfer 
(see USPN 5,992,617). 

The identification of the presence or absence of known mutations can also 
conveniently be detected by PCR followed by restriction analysis and/or sequencing using 
techniques well known to those in the art. PCR analysis furthermore offers an efficient 
technique for identifying new mutations in genes already known to contain mutations that 
correlate with the predisposition of a Chinese individual to develop type 2 diabetes, or in 
identifying associative mutations in additional candidate genes. PCR primers should be at 
least 12 base pairs in length, preferably 15-18 base pairs in length, and may be as long as 
25-30 base pairs in length. They can be designed to anneal to the wild-type gene sequence 
in regions that flank a mutation in a gene of interest, such that extension from the primer 
amplifies a region that allows the detection of the presence or absence of a mutation of 
interest. Primers can also be designed such that their extension results in an amplified 
sequence only in the presence of either a wild-type or mutant gene, as desired. This can be 
accomplished by designing a primer with at least one nucleotide at the 3' end that is 
mismatched with the wild-type sequence, but matched to a mutant sequence. The invention 
is exemplified by primer pairs used to screen HNF-loc, glucokinase, amylin and human 
mitochondrial DNA for mutations. Of particular interest are nucleic acid primers that can 
)e used to screen mutations in HNF-la and glucokinase that have not yet been previously 
eported (see for example, SEQ ID Nos: 34-36). Simultaneous sequencing of several 
uicleic acid samples can also be carried out on a microchip (see USPN 6, 197,506) 
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For SSCP, primers axe designed that amplify DNA products of about 250-300 bp in 
length across non-duplicated segments of the gene of interest. For each amplification 
product, one gel system and two running conditions are used. Each amplification product is 
applied to a 10% polyacrylamide gel containing 10% glycerol. Separate aliquots of each 
amplimer are subjected to electrophoresis at 8 W at room temperature for 16 hours and at 
30 W at 4 °G. for 5,4 hours. These conditions were previously shown to identify 98% of 
the known mutations in the CFTR gene (Ravnik-Glavac et al. , (1994) Hum Mol Genet 
3:801). 

As with identification of associative mutations of interest, identification of 
associative SNPs that correlate with the increased risk of a Chinese individual to develop 
type 2 diabetes can be accomplished by nucleic acid sequencing of desired regions of 
genomic or complementary DNA. Screening for SNPs is pursued most efficiently using 
microarray technologies where attached nucleic acid sequences attached to a solid support 
such as a microchip are exposed to hybridization conditions that allow the discrimination 
between two nucleic acid sequences that differ at one nucleotide (see for example, Wang, et 
al. (1998) Science 280:1077; and Hacia, et al (1998) Nature Genet 18:155). Alternatively, 
mass spectrophotometers can be used to identify small mass differences in PCR products 
that have single nucleotide polymorphisms (see Kirpekar, et al. (1998) Nucleic Acids Res 
26:2554). A further means of analyzing genetic information is "dynamic allele specific 
hybridization" (DASH). This technique uses labeled oligonucleotides in a multiwell format 
that will fluoresce when the oligonucleotide exists in a double-stranded form, but not when it 
is in single-stranded form. Adding a single strand of the DNA to be tested allows the strands 
to hybridize. The temperature at which the strands denature will allow identification of the 
base at the SNP. The DASH technique has the advantages of being technically simple, and 
not requiring expensive equipment. Additional techniques that can be used in the screening 
for SNPs associated with the genetic predisposition of a Chinese person to develop type 2 
iiabetes include exonuclease resistance microsequencing, solution-phase or solid phase 
extension of ddNTPs, and oligonucleotide ligation assay (as described in USPN 5,952,174, 
lerein incorporated by reference). 

After the presence of an associative mutation or SNP is detected by any of the 
bove techniques, the specific nucleic acid alteration comprising the mutation is identified 
y direct DNA sequence analysis or restriction analysis or a combination of both. In this 
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manner, previously unidentified mutations in genes that encode proteins involved in insulin 
secretion, or in genes associated with obesity or diabetic kidney disease may be defined. 
For instance, new mutations could be identified with other genes known to closely correlate 
with familial type 2 diabetes in Chinese subjects (e.g., other MODY genes). Examples of 
additional MODY genes include hepatocyte nuclear factor 4a (HNF-4a), hepatocyte 
nuclear factor lp (HNF-lp), and insulin promoter factor 1 (DPF-1). Additional candidate 
genes of particular interest for screening because mutations or polymorphisms of the wild- 
type genes are positively associated with type 2 diabetes and nephropathy in Chinese 
individuals include those that encode angiotensin converting enzyme 
(ACE)/angiotensinogen (AGT) (Tomino, etal. (1999) Nephron 82:139; Hsieh, etal. 
(2000) Nephrol Dial Transplant 15:1008; Thomas, et al. (2001) Diabetes Care 24;356), 
aldose reductase (Ko, et al (1995) Diabetes 44:727; Moczulski, et al. (1999) Diabetologia 
42:94) and plasminogen activator inhibitor-1 (PAI-1) (Wong, et al. (2000) Kidney lnt 
57:632). 

Nucleic acid sequences that encode genes involved with glucose metabolism, insulin 
resistance, obesity and diabetic kidney can also be screened to identify mutations in, for 
example, proteins that influence insulin binding to its receptor, that are involved in the 
insulin signalling pathway, that influence glucose uptake and cell metabolism. Specific 
examples include associative mutations in the a or p chain of the insulin receptor, the 
insulin receptor substrate proteins (ERS-1 and IRS-2), glucose transporter proteins GLUT2 
and GLUT4, and transcription factors HNF-3P and NeuroDl/Beta2 and to correlated any 
dentified mutation an/or polymorphism with indidence of type 2 diabetes. Examples of 
candidate genes where mutations or polymorphisms have been shown to be associated with 
ype 2 diabetes and obesity in other populations include genes that encode the transporter 
JLUT4 (Abel, et al. (2001) Nature 409:729), the beta-3-adrenergic receptor (Oeveren van- 
)ybicz, et al. (2001) Diabetes Obes Metab 3:47), the hormone resistin (Steppan, et al. 
1001) Nature 409:307), the peroxisome proliferator-activated receptor gamma2 
'PARgamma) (Hasstedt, etal. (2001) J Clin Endocrinol Metab 86:536), uncoupling 
•otein-1 (UCP-1) (Heilbronn, et al. (2000) Diabetologia 43:242), Ieptin (Ohshiro, et al. 
000) J Mol Med 78:516), G protein beta 3 subunit and insulin receptor substrate-1 
-osskopf, et al. (2000) 5:484), and the dopamine D2 receptor (Jenkinson, et al. (2000) lnt 
Obes Relat Metab Disord 24: 1233). Additionally, mutations or polymorphisms shown to 
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be closely associated with type 2 diabetes and nephropathy in other populations include 
genes that encode the G protein beta 3 subunit (Beige, et al (2000) Nephrol Dial 
Transplant 15:1384; Zychma, et al. (2000) Am J Nephrol 20:305), 
methylenetetrahydrofolate reductase (MTHFR) (Shpichinetsky, et al. (2000) JNutr 
130:2493, the glucose transporter GLUT1 (Grzeszczak, et al (2001) Kidney Int 59:631), 
and paraoxonase (PON1) (Inoue, et al (2000) Metabolism 49:1400). 

Additionally, proteins translated from genes carrying at least one mutation 
associated with increased risk of a Chinese individual to develop type 2 diabetes are 
contemplated by the invention and find use in functional diagnostic assays and in the 
production of diagnostic antibodies that bind to the mutant but not the wild-type protein. 
The polypeptides may be the translational products of the entire mutant gene, as well as 
peptides of twelve or more amino acids derived therefrom that contain at least one mutation 
of interest. The polypeptide^) may be isolated from human tissues obtained by biopsy or 
autopsy, or may be produced in a heterologous cell by recombinant DNA methods, well 
known to those in the art (as disclosed in Molecular Cloning, A Laboratory Manual (2nd 
Ed., Sambrook, Fritsch and Maniatis, Cold Spring Harbor, 1989), or Current Protocols in 
Molecular Biology (Eds. Aufubel, Brent, Kingston, More, Feidman, Smith and Stuhl, 
Greene Publ. Assoc., Wiley-Interscience, NY, N.Y., 1992), both references herein 
incorporated by reference). Peptides comprising HNF-la-, glucokinase- or amylin- 
specific sequences may be derived from isolated larger polypeptides described above, using 
proteolytic cleavages by e.g. proteases such as trypsin and chemical treatments such as 
cyanogen bromide that are well-known in the art. Alternatively, peptides up to 60 residues 
in length can be routinely synthesized in milligram quantities using commercially available 
peptide synthesizers. 

Recombinant translational products are expressed from vectors comprising mutant 
nucleic acid sequences of wild-type nucleic acid sequences that encode proteins involved in 
insulin secretion. Exemplified mutant nucleic acid sequences of interest include those that 
encode HNF-la, glucokinase or amylin with single amino acid residue changes, as 
depicted in SEQ ID Nos:l-13, and particularly SEQ ID NO:2, SEQ ID NO:7 and SEQ ID 
NO: 10. A large number of vectors, including plasmid and fungal vectors, have been 
described for expression in a variety of eukaryotic and prokaryotic hosts, and may be used 
for gene therapy as well as for simple protein expression. Vectors used for expression will 
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also include a promoter operably linked to the mutant polypeptide encoding portion, that is " 
preferably the cDNA sequence of the mutated gene of interest or a part thereof that encodes 
a polypeptide of at least 12 amino acids. The encoded polypeptide may be expressed by 
using any suitable commercially available vectors, and any suitable host cells, using 
methods disclosed or cited herein or otherwise known to those skilled in the relevant art. 
The particular choice of vector/host is not critical to the operation of the invention. 

Appropriate host cells include bacteria, archebacteria, fungi, especially yeast, and 
plant and animal cells, especially mammalian cells. Of particular interest are E. coli, 
B. Subtilis, Saccharomyces cerevisiae, SF9 cells, C129 cells, 293 cells, Neurospora, and 
CHO cells, COS cells, HeLa cells, and immortalized mammalian myeloid and lymphoid 
cell lines. Preferred replication systems include M13, ColEl, SV40, baculovirus, lambda, 
adenovirus, and the like. A large number of transcription initiation and termination 
regulatory regions have been isolated and shown to be effective in the transcription and 
translation of heterologous proteins in the various hosts. Examples of these regions, 
methods of isolation, manner of manipulation, etc. are known in the art. Under appropriate 
expression conditions, host cells can be used as a source of recombinantly produced mutant 
polypeptides of interest. 

The translational products of mutant HNF-la, glucokinase or amylin, and/or 
fragments or portions thereof may be used to produce specific antibodies. The antibodies 
may be polyclonal or monoclonal, may be produced in response to the fully translated 
mutant polypeptide or to synthetic peptides as described above. Such antibodies are 
conveniently made using the methods and compositions disclosed in Harlow and Lane, 
Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, 1988, other references 
cited herein, as well as immunological and hybridoma technologies known to those in the 
art. Importantly, the antibodies raised against translation products from nucleic acids 
sequences carrying at least one mutatation associated with type 2 diabetes should 
distinguish between the mutant amino acid sequence and the wild-type amino acid 
sequence. In particular, antibodies should have very little or no cross-reactivity for the 
wild-type sequence. Preferably the anti-mutant protein antibodies should bind with higher 
affinity to the mutant polypeptide than to wild-type polypeptide, with binding to the mutant 
polypeptide at levels 500:1, more preferably 1,000:1, greater than binding the wild-type 
polypeptide. 
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Isolated polypeptides corresponding to the entire length of the mutant polypeptide or 
a peptide of at least 12 amino acids in length containing a mutation of interest may be used 
in accordance with conventional methods to immunize a mammal, (e.g., mouse or higher 
mammal, primate, or chimeric or transgenic animals which produce human 
immunoglobulins) in accordance with conventional procedures. See for example, U.S. Pat. 
Nos. 4,172,124; 4,350,683; 4,361,549; and 4,464,465. Hybridomas may be prepared by 
fusing available myeloma lines, e.g., NS/1, Ag8.6.5.3, etc., with peripheral blood 
lymphocytes, splenocytes or other lymphocytes of the immunized host and the resulting 
immortalized B-lymphocytes (e.g., hybridomas, heteromyelomas, EBV transformed cells, 
etc.) selected, cloned and screened for binding to a mutant polypeptide of a wild-type 
protein involved in insulin secretion or glucose metabolism. Monoclonal antibodies raised 
against a mutant polypeptide sequence of interest may be of any immunoglobulin class such 
as IgA, IgD, IgE, IgG and IgM, preferably IgG or IgM, and may be of any one of the 
subclasses of the classes. Whole antibodies, or fragments thereof which retain binding 
activity, may be employed, such as Fab, F(ab ? ) 2 , or the like. Once the antibodies with 
binding specificity for the mutant polypeptide are available, these antibodies may be used 
for screening. Antibodies that distinguish between normal and mutant forms of HNF-la, 
glucokinase, amylin or other mutant/wild-type pairs of proteins involved in insulin 
secretory function may be used in diagnostic tests employing ELISA, EMIT, CEDIA, 
SLIFA, and the like. 

For an assessment of total risk of developing disease or in designing individualized 
treatments of diagnosed patients, identified mutations and polymorphisms that are 
indicative of a Chinese individual to develop type 2 diabetes are correlated with phenotypic 
)arameters of screened patients and interpreted with consideration of a positive or negative 
amily history of the disease. Genetic studies will be correlated with data from individuals 
ndicating hormone levels (growth hormone, adrenaline, Cortisol, noradrenaline, insulin), 
nthropometry (body-mass index; waist-to-hip ratio), hemodynamics (blood pressure), 
irdio vascular risk factors (HDL, LDL, cholesterol, triglycerides) and autoimmunity (anti- 
utamic acid decarboxylase antibodies). For instance, a patient with a single mutation in 
e glucokinase gene may never develop symptoms, whereas the likelihood of a patient 
ith a mutation in both the glucokinase gene and the HNF-la gene or a mutation in the 
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glucokinase gene and the phenotypic attribute of obesity to develop overt type 2 diabetes is 
relatively higher. A positive family history of the disease would increase the predicted 
predisposition even more. Obtaining a genotypic assessment while a patient shows no 
signs of developing disease, or while showing preliminary signs of disease such as 
impaired glucose tolerance (IGT), can enable a physician to initiate therapy or suggest 
lifestyle changes that prevent the onset or progression of overt symptoms. For example, a 
patient identified as having a mutation in the HNF-la gene and IGT, can be treated with 
diet and/or oral drugs and/or insulin early enough that hyperglycemic toxicity of pancreatic 
p-cells and further insulin secretory dysfunction due to their death is prevented or 
ameliorated. In this way, severe complications associated with progressive type 2 diabetes, 
such as nephropathy, retinopathy and sensorineural loss, can be more commonly averted. 

In addition to allowing a clinician to better tailor traditional therapies for treating 
type 2 diabetes, such as diet, oral drugs and insulin, identification of associative mutations 
can enable a clinician to design tailored therapies, such as introducing a wild-type gene into 
a patient to replace a mutant gene that encodes a malfimctioning protein. For gene therapy 
methods, transfection in vivo is obtained by introducing a therapeutic transcription or 
expression vector into the mammalian host, either as naked DNA, complexed to lipid 
carriers, particularly cationic lipid carriers, or inserted into a viral vector, for example a 
recombinant adenovirus. The introduction into the mammalian host can be by any of 
several routes, including intravenous or intraperitoneal injection, intratrachealiy, 
intrathecally, parenterally, intraarticular^, intranasally, intramuscularly, topically, 
transdermally, application to any mucous membrane surface, corneal installation, etc. Of 
particular interest is the introduction of a therapeutic expression vector into a circulating 
bodily fluid or into a body orifice or cavity, such as the heart. Thus, intravenous 
administration and intrathecal administration are of particular interest since the vector may 
be widely disseminated following such routes of administration, and aerosol administration 
finds use with introduction into a body orifice or cavity. Particular cells and tissues can be 
targeted, depending upon the route of administration and the site of administration. For 
example, a tissue which is closest to the site of injection in the direction of blood flow can 
be transfected in the absence of any specific targeting. An arterial catheter can be used to 
introduce the expression vector into an organ such as the heart or kidney. The eye can be 
accessed directly either by the use of ocular drops or by injecting into the eye. For 
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accessing nerves, this can be by injection into the nerve or injection into the region of the 
cell body. If lipid carriers are used, they can be modified to direct the complexes to 
particular types of cells using site-directing molecules. Thus, antibodies or ligands for 
particular receptors or other cell surface proteins may be employed, with a target cell 
associated with a particular surface protein. An amino terminal mitochondrial targeting 
sequence joined to a nucleic acid can be used to target the nucleic acid to the mitochondria. 
See Taylor et al, Nature Genetics 15:212-215, 1997. 

Any physiologically acceptable medium may be employed for administering the 
DNA, recombinant viral vectors or lipid carriers, such as deionized water, saline, 
phosphate-buffered saline, 5% dextrose in water, and the like as described above for the 
pharmaceutical composition, depending upon the route of administration. Other 
components may be included in the formulation such as buffers, stabilizers, biocides, etc. 
These components have found extensive exemplification in the literature and need not be 
described in particular here. Any diluent or components of diluents that would cause 
aggregation of the complexes should be avoided, including high salt, chelating agents, and 
the like. 

The amount of therapeutic vector used will be an amount sufficient to provide for a 
therapeutic level of expression in a target tissue susceptible to diabetic complications or for 
adequate dissemination to a variety of tissues after entry into the bloodstream and to 
provide for a therapeutic level of expression in susceptible target tissues. A therapeutic 
level of expression is a sufficient amount of expression to prevent, treat, or palliate one or 
more diabetic complication or the symptoms of diabetic complications. In addition, the 
dose of the nucleic acid vector used must be sufficient to produce a desired level of 
transgene expression in the affected tissue or tissues in vivo. Other DNA sequences, such 
as adenovirus VA genes can be included in the administration medium and be co- 
transfected with the gene of interest. The presence of genes coding for the adenovirus VA 
gene product may significantly enhance the translation of mRNA transcribed from the 
expression cassette if this is desired. 

A number of factors can affect the amount of expression in transfected tissue and 
thus can be used to modify the level of expression to fit a particular purpose. Where a 
high level of expression is desired, all factors can be optimized, where less expression is 
desired, one or more parameters can be altered so that the desired level of expression is 
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attained. For example, if high expression would exceed the therapeutic window, then less 
than optimum conditions can be used. 

The level and tissues of expression of the recombinant gene may be determined at 
the mRNA level as described above, and/or at the level of polypeptide or protein. Gene 
product may be quantitated by measuring its biological activity in tissues. For example, 
protein activity can be measured by immunoassay as described above, by biological assay 
such as inhibition of ROS, or by identifying the gene product in transfected cells by 
immunostaining techniques such as probing with an antibody which specifically recognizes 
the gene product or a reporter gene product present in the expression cassette. 

Typically, the therapeutic cassette is not integrated into the patient's genome. If 
necessary, the treatment can be repeated on an ad hoc basis depending upon the results 
achieved. If the treatment is repeated, the patient can be monitored to ensure that there is 
no adverse immune or other response to the treatment. 

The following examples are offered by way of illustration of the present invention, 
not limitation. 

EXPERIMENTAL 

Example 1 

Identification of mutations in glucokinase and hepatocyte nuclear factor la genes in 
Chinese patients with early-onset Type 2 diabetes mellitus/MODY 

This example illustrates mutations identified in the glucokinase, HNF-la and 
HNF-4cc genes in a cohort of Chinese patients. Mutations in the glucokinase and HNF-la 
genes are relatively common in early-onset diabetes and they account for about 3 % and 
5%, respectively, of the present Chinese early-onset diabetic patients, v 

Experimental Design and Methods 
Subjects 

The study group consisted of 92 unrelated patients (age 34 ± 5 years (mean ± SD) ? 
range 18—40 years; 30 males and 62 females) who were diagnosed with Type 2 diabetes 
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before 40 years of age and who had a positive family history (at least one first degree 
relative with Type 2 diabetes). The mean age at diagnosis was 30 ±5 years (range 16—40 
years). Thirteen (14%) of these patients met the minimal criteria of MODY (age at 
diagnosis before 25 years old and presence of diabetes in two consecutive generations). 
These patients were selected from a database containing 1800 cases recruited in the 
Diabetes and Endocrine Centre of the Prince of Wales Hospital. Family members of 
probands with MODY gene mutations, if available, were recruited and underwent a 
75-gram oral glucose tolerance test (OGTT). One hundred healthy Chinese (age 33 ± 10 
years, 40 males and 60 females) without a history of. diabetes were recruited as controls 
amongst hospital staff and students. Informed consent was obtained from each subject for a 
blood sample to be taken for DNA isolation and measurement of clinical parameters. This 
study was approved by the Clinical Research Ethics Committee of The Chinese University 
of Hong Kong. 

Screening of glucokinase, HNF-l a and HNF-4 a genes for mutations 

The minimal promoter region and exons of the glucokinase (p* cell form), HNF-la 
and HNF-4a (HNF-4a2 form) genes were screened for mutations by direct sequencing of 
polymerase chain reaction (PCR) products as described (Froguel, et al. (1993) N Engl J 
Med 328:697; Yamagata, etal. (1996) Nature 384:455; Yamagata, etal. (1996) Nature 
384:458). The occurrence of putative mutations in other family members and controls was 
determined by PCR-restriction fragment length polymorphism (RFLP). An artificial 
restriction site was introduced into either the wild-type or mutant sequence if the nucleotide 
substitution did not lead to gain or loss of a restriction site. Briefly, direct sequencing 
identified 5 mutations in the HNF-la gene (G20R, R203H; S432C, I618M and 
IVS2nt-lG->A) and 3 mutations in the glucokinase gene (I110T, A119D and G385V) that 
are unique to Chinese subjects. They were screened as follows. 

HNF-la G20R was screened by using the forward primer 
5'-GGCAGGCAAACGCAACCCACG-3' (SEQ ID NO:14) and modified reverse primer 
5 , -CAGTGCCTCTTTGCTCAGGC-3 , (SEQ ID NO: 15) for PCR amplification followed 
by digestion with Sail. The wild-type allele showed 19 and 140 bp products whereas the 
mutant allele showed a 159 bp product. 

HNF-la R203H was screened by using the forward primer 
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5 '-TGCCTGC AG AGTTC ACCCATG-3 ' (SEQ ID NO: 16) and modified reverse primer 
5 '- ATCTGCTGGGATGCTGGGCCCCACTTGC AA-3 ' (SEQ ID NO: 17) for PCR 
amplification followed by digestion with BsrDl. The wild-type allele showed a 121 bp 
product whereas the mutant allele showed 26 and 95 bp products. 

HNF-lct S432C was screened by using the forward primer 
5 '-TGG AGC AGTCCCTAGGG AGGC-3 ' (SEQ ID NO:18) and reverse primer 
S'-GTTGCCCCATGAGCCTCCCAC-S' (SEQ ID NO: 19) for PCR amplification followed 
by digestion with CacSl. The wild-type allele showed 104 and 218 bp products whereas the 
mutant allele showed 37, 67 and 218 bp products. 

HNF-loc I618M was screened by using the forward primer 
5'-GTACCCCTAGGGACAGGCAGG-3 > (SEQ ID NO:20) and reverse primer 
5'-ACCCCCCAAGCAGGCAGTACA-3' (SEQ ID NO:21) for PCR amplification followed 
by digestion with Taql. The wild-type allele showed 88 and 160 bp products whereas the 
mutant allele showed a 248 bp product. 

HNF-lcc IVS2nt-lG->A was screened by using the forward primer 
5'-GGGCAAGGTCAGGGGAATGGA-3' (SEQ ID NO:22) and reverse primer 
5'-CAGCCCAGACCAAACCAGCAC-3' (SEQ ID NO:23) for PCR amplification followed 
by digestion with ft/I. The wild-type allele showed 73 and 231 bp products whereas the 
mutant allele showed a 304 bp product. 

Glucokinase MOT was screened by using the forward primer 
5 '-GTCCCTGAGGCTG AC AC ACTT -3 ' (SEQ ID NO:24) and reverse primer 
5 ' - AGCTGGGCCCTGAG ATCCTGCA-3 ' (SEQ ID NO:25) for PCR amplification 
followed by digestion with Fold. The wild-type allele showed 108 and 142 bp products 
whereas the mutant allele showed a 250 bp product. 

Glucokinase A119D was screened by using the forward primer 
5 '-ACCTGGGTGGCACTAACTTCA-3 ' (SEQ ID NO:26) and modified reverse primer 
5'-CGGCCCCTGCGCTGCTCACCATCTGA-3' (SEQ ID NO:27) for PCR amplification 
followed by digestion with Bell. The wild-type allele showed a 150 bp product whereas the 
mutant allele showed 28 and 122 bp products. 

Glucokinase G385V was screened by using the forward primer 
5'-GGACTGTCGGAGCGACACTCA-3 ' (SEQ ID NO:28) and modified reverse primer 
5'-GCGGTTGATGACGCCTGCCAG-3 ' (SEQ ID NO:29) for PCR amplification followed 
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by digestion with Faul. The wild-type allele showed 5, 22, 44 and 137 bp products 
whereas the mutant allele showed 5, 44 and 159 bp products. 

Mutations in the amylin gene (S20G) and mitochondrial DNA (A3243G) were 
screened as follows. 

Amylin S20G was screened by using the forward primer 
5 '-TCAC ATTTGTTCCATGTTAC -3' (SEQ ID NO:30) and reverse primer 
5 '-C AATAACT ATAGAGTTACATTG-3 * (SEQ ID NO:31) for PCR amplification 
followed by digestion with Msph The wild-type allele showed a 239 bp product whereas 
the mutant allele showed 99 and 140 bp products. 

Mitochondrial DNA A3243G was screened by using the forward primer 
5 '-A AGGTTCGTTTGTTC AACGA-3 1 (SEQ ID NO:32) and reverse primer . 
5 '-AGCGAAGGGTTGTAGTAGCC-S ' (SEQ ID NO:33) for PCR amplification and 
labeling of PCR product with a 32 PdATP at the last cycle. The PCR products were then 
digested mth Apal and analysed on 8% denaturing polyacrylamide gels. The wild-type 
allele showed a 427 bp product whereas the mutant allele showed 213 and 214 bp products. 

Clinical studies 

All patients underwent a structured assessment including documentation of family 
history, age at diagnosis and body mass index (BMI) (Piwernetz, et al (1993) Diabetic 
Med 10:371; Chan, et al (1997) Hong Kong Auth Qual Bull 2:3). Family history was 
documented in two generations only since the diabetic status of grandparents was usually 
unknown. A fasting blood sample was taken for the measurement of glucose, C-peptide and 
glycosylated haemoglobin (HbA lc ). Obesity was defined as a BMI >21 kg/m 2 in men and 
_>25 kg/m 2 in women (National Diabetes Data Group (1979) Diabetes 28:1039). 

Assays 

Plasma glucose concentrations were measured by a glucose oxidase method 
(Diagnostic Chemicals, Charlottetown, Prince Edward Island, Canada). C-peptide was 
measured by radioimmunoassay (Novo-Nordisk, Copenhagen, Denmark). HbA, c was 
measured by gel electrophoresis (Ciba Corning Diagnostics Corp, Palo Alto, CA). 



Data analysis 
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Data are expressed as mean ± SD if normally distributed. Otherwise, data are 
expressed as median and range. 

Results 

Mutations and polymorphisms in the glucokinase, HNF-l a and HNF-4 a genes 

Screening of the promoter region and exons la, 2-10 of me glucokinase gene 
(Stoffel, et al (1992) Proc Natl Acad Sci 89:7698; Tanizawa, et al. (1992) 6: 1070) 
revealed three novel missense mutations: I110T, A119D and G385V. In addition to these 
mutations, three uncommon variants (two of which had not been previously described) and 
two polymorphisms were found in the 5'-untransIared region of the mRNA and intron 
regions (Table 1). The brother and mother of subject HK84 (Table 1) also inherited the 
I110T mutation (Fig. 7). The mother was diagnosed with diabetes at the age of 64 years 
upon screening. The brother aged 25 years, when tested with a 75 g OGTT had a plasma 
glucose at 0 and 120 min of 6.3 mmol/1 and 6.9 mmol/l, respectively. These results were 
inconclusive, suggesting impaired fasting glucose (BFG) by the 1997 ADA criteria but not 
reaching that of impaired glucose tolerance (IGT) by the 1998 WHO criteria. 

Screening of the HNF-la gene revealed four missense mutations (G20R. R203H, 
S432C and I618M) and one splice acceptor site mutation (IVS2nt-lG->A) (Table 2). All of 
these represent mutations in the HNF-la gene unique to Chinese patients. Subject HK10 
(Table 2) had three siblings (ages 26—36 years) with diabetes. The affected siblings all had 
inherited the IVS2nt-lG-»A mutation while another sibling and the father with IGT had 
not. Moreover, the maternal grandparents, uncle and mother of HK10 were diabetic but 
they were not available for screening (Fig. 7) (Chan, et al (1990) Diabetic Med 7:21 1). 
Subject HK54 (Table 2) had four siblings (age 33—43 years) with normal glucose tolerance 
and one sibling (age 39 years) with IGT. The father and mother were diagnosed as having 
diabetes at the ages of 50 and 60 years, respectively. Neither the mother and nor any of the 
siblings had inherited the R203H mutation (Fig. 7). In addition to the putative diabetes- 
associated mutations in HNF-la, two substitutions resulting in common amino acid 
polymorphisms, four silent mutations and nine variants/polymorphisms in introns were 
identified (Table 2). Family members of the other five probands (Tables 1 and 2) with 
glucokinase or HNF-la missense mutations were not available for screening. None of the 
mutations in the glucokinase and HNF-la genes were found in 100 healthy controls. 
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Analysis of the promoter region and exons la, 2—10 of the HNF-4a gene (Furuta, 
et al (1997) Diabetes 46: 1652) revealed no obvious diabetes-associated mutations. Three 
patients were heterozygous for a previously described amino acid polymorphism, T/I130 
(Yamagata et al, (1997) Nature 384:458). Two patients were heterozygous for a silent 
mutation in the codon for L211, and one patient was heterozygous for a silent mutation in 
the codon for P441 (Table 3). There were two polymoiphisms in the intron upstream of 
exon 2 (intron IB) and a G-+A substitution in the promoter was found in the heterozygous 
state in one patient. The G-+A substitution in the promoter at nucleotide -462 was not 
located in a known or-acting regulatory region of the gene (Furuta, et al. (1997) supra) 
and its effect on the regulation of expression of HNF-4cc remains to be determined. 

Clinical features of patients with MODYor unknown etiology 

The clinical features of the patients with mutations in the glucokinase and HNF-la 
genes or with unknown etiology are shown in Table 4. Of the 92 patients, 54 (59%) were 
non-obese at the time of study. The mean age at diagnosis of the patients with glucokinase 
mutation-associated diabetes ('glucokinase diabetes 1 ) was 28 years. All three subjects had 
mild hyperglycemia and satisfactory glycemic control (fasting glucose <7.4 mmol/1; HbA lc 
<6.7%; non-diabetic range: 5.1—6.4%). These patients had varying degrees of basal 
pancreatic P cell secretory function as indicated by their fasting C-peptide levels (0.28— 
1.60 nmol/1) (Chan, etal (1990) supra). All were treated with diet or oral drugs. No 
diabetic complications were observed in the three patients with glucokinase mutation 
(Froguel, etal. (1993) supra; Page, etal. (1995) 12:209; Velho, etal. (1997) 40:217). 

The mean age at diagnosis of the patients with HNF-la mutation-associated diabetes 
was 31 years. Among the four patients (HK30, 54, 90 and 92) with missense mutations, all 
had mild hyperglycemia and satisfactory glycemic control (fasting glucose .<7.4 mmol/1; 
HbA, c <7. 1 %) but exhibited varying degrees of basal pancreatic P cell secretory function 
(fasting C-peptide, 0.10—0.49 nmol/1). They did not have diabetic complications and were 
treated with diet or oral drugs. The subject (HK10) with the splice-site IVS2nt-iG-+A 
mutation was not overweight when diagnosed at the age of 19 years (Fajans (1990) 
Diabetes Care 13:49-64) and presented with proliferative retinopathy and clinical 
proteinuria. She was treated with insulin continuously for three months after the diagnosis. 
She eventually developed neuropathy and renal failure. 
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The mean age at diagnosis of the 84 patients with unknown etiology was 30 years, 
similar to those with glucokiriase or HNF-la gene mutations. Large variations in the 
degree of hyperglycemia (fasting glucose 8.5 ± 3.3 mmol/1) and basal pancreatic (5 cell 
secretory function (fasting C-peptide 0.03—4.96 nmol/1) were observed. Most of these 
pateints were treated with oral drugs or diet (92%). 

Example 2 

Mitochondrial DNA A3243G mutation in patients with 
early- or late-onset Type 2 diabetes mellitus in Hong Kong Chinese 

This example illustrates the prevalence of the mitochondrial DNA A3243G mutation 
in the Hong Kong Chinese population as represented by a large cohort of type 2 diabetic 
patients with differing ages of diagnosis and clinical phenotypes. 

Experimental Design and Methods 
Subjects 

The study group consisted of 906 unrelated type 2 diabetic patients diagnosed 
according to the 1985 WHO criteria (World Health Organization, 1985). This cohort 
included four groups of patients selected according to the age of diagnosis and the presence 
or absence of family history of diabetes. Groups 1 and 2 consisted of 219 and 128 patients, 
respectively, with ah early age of diagnosis (<40 years) and with (Group 1) or without 
(Group 2) a family history of diabetes. Groups 3 and 4 consisted of 211 and 348 patients, 
respectively, with an older age of diagnosis (>40 years) and with (Group 3) or without 
(Group 4) a family history of diabetes. Patients in each of these groups were randomly 
selected from a cohort recruited in the Diabetes and Endocrine Centre of the Prince of 
Wales Hospital, which has a catchment of 1.2 million population in Hong Kong. All the 
patients underwent a structured assessment based on the Europe DiabCare Protocol 
(Piwernetz, et al. (1993) Diabetic Med 10:371; Chan, et al (1997) Hosp Auth Qual Bull 
2:3). Family members of mt3243 mutation carriers, if available, were recruited and 
underwent a 75 gram oral glucose tolerance test (OGTT). Two hundred and thirteen 
healthy Chinese without a history of diabetes were recruited as control subjects amongst 
hospital staff and students. The present study group included 75 early onset type 2 diabetic 



WO 02/072875 PCT/CN02/00158 

34 

(two of whom had an mt3243 mutation) and 95 control subjects who were included in a 
previous report {see Smith, etal. (1997) Diabetic Med 14:1026). Informed consent was 
obtained from each subject for a blood sample to be taken for DNA isolation and 
measurement of clinical partners. This study was approved by the Clinical Research Ethics 
Committee of The Chinese University of Hong Kong. 

Mt3243 mutation analysis 

Leukocyte DNA was extracted by standard methods involving proteinase K and 
phenol/chloroform (Sambrook, et al (1989) Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, New York, incorporated herein by reference). The 
mt3243 genotype was determined by polymerase chain reaction (PCR) amplification and 
Apal digestion as described (Smith, et al., (1997) supra). In brief, the DNA region 
spanning nucleotide 3029 and 3456 was amplified by PCR and labelled with a^PdATP at 
the last cycle. This method prevents the underestimation of the proportion of mutant 
mtDNA as a consequence of heteroduplex formation during the PCR (Schoflher, et al. 
(1990) Cell 61:931). The PCR products were then digested with Apal (Gibco BRL, 
Gaithersburg, MD, USA) for 2h at 30°C. Digested PCR products were electrophoresed on 
8% denaturing polyacrylamide gels and visualized by autoradiography. The presence of 
mt3243 led to the cleavage of the 427 bp product into 213 and 214 bp fragments. Standards 
containing 0-100% mutant mt3243 (made by mixing a cloned DNA carrying no mt3243 
mutation and another cloned DNA carrying > 99% mutant mt3243 in different proportions, 
kindly given by Dr J. van den Ouweland, Leiden University) were also included in the 
assay. The 100% mutant DNA was used as a positive control to evaluate completeness of 
PCR product digestion. The intensity of bands was quantified by a Bio-Rad Model GS-670 
imaging densitometer and a Molecular Analyst software (version 1.3) (Bio-Rad, Hercules, 
CA, USA). The proportion of mt3243 in a sample was calculated by dividing the intensity 
at mutant bands (213 and 214 bp) by the total intensity of both wild-type and mutant bands. 

Clinical studies 

All patients underwent a structured assessment including documentation of family 
history, age of diagnosis, body mass index (BMI) and waist-to-hip ratio. Audiometry was 
performed by a technician at the otolaryngology department to assess the sensorineural 
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status in subjects carrying the mt3243 mutation. A fasting blood sample was taken for the 
measurement of glucose, C-peptide, insulin and glycosylated haemoglobin (HbA lc ). Insulin 
resistance (IR) was estimated using the homeostasis model assessment (HOMA) where 
IR=fasting insulin x fasting glucose/22.5 (Matthews et al, (1985) Diabetologia 28:412). 
Obesity was defined as a BMI J> 27 kg/m 2 in men and >25 kg/m 2 in women (National 
Diabetes Data Group, 1979). The basal pancreatic p-cell reserve was also assessed by 
plasma fasting C-peptide level. Patients with a C-peptide level <0.2nmol/l were 
considered to be insulin deficient (Service et al 9 . (1997) Diabetes Care 20:198). 

Biochemical assays 

Plasma glucose was measured by a glucose oxidase method (Diagnostic Chemicals, 
Charlottetown, PEI, Canada). HbA lc was measured by gel electrophoresis (Ciba Corning 
Diagnostics Corp, Palo Alto, CA. USA). C-peptide was measured by radioimmunoassay 
(Novo-Nordisk, Copenhagen, Denmark) with an intra-assay coefficient of variance (CV) of 
3.4% and interassay CV of 9.6%. Insulin was measured by radioimmunoassay (Pharmacia, 
Uppsala, Sweden) with in intra-assay CV of 6% and interassay CV of 13.8%. 

Statistical analysis 

Data are expressed as mean ± SD or median (range) as appropriate. The X 2 test 
was used for analysing categorical data. Spearman correlation was used for measurement 
of association between variables. All statistics was performed with the Statistical Package 
for Social Sciences (SPSS) for Windows, version 6.1. A P-value <0.05 was considered as 
significant. 

Results 

The clinical details of the 906 type 2 diabetic patients are shown in Table 5. A 
dgnificantly higher prevalence of maternal over paternal history of diabetes was found in 
)oth early- (Group 1) and late-onset (Group 3) diabetic patients with a positive family 
dstory (Table 5). 
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Amongst the 906 type 2 diabetic patients, in addition to the two patients reported 
previously (Smith, et al. (1997) supra), three more patients carrying the mt3243 mutation 
were identified. In Group 1, this mutation was found in 1.8% of (four of 219) early onset 
patients with a positive family history. This prevalence increased to 3% (four of 133) if 
only those with a positive maternal family history were considered. In addition, one of the 
348 late-onset patients without a family history (Group 4) was found to have this mutation 
(0.3%). None of the 128 early onset patients who had no family history (Group 2) or 211 
late-onset patients with a positive family history (Group 3) or 213 control subjects had the 
mutation. 

Amongst the five probands with the int3243 mutation, three families were recruited 
(Fig. 8). In family A, family members with diabetes or IGT were identified but none of 
them carried the mutation. In Family E, two more subjects were found to have the mutation 
of whom one had diabetes. The clinical and biochemical characteristics of subjects carrying 
the mt3243 mutation are summarized in Table 6. 

. The percentage of mt3243 varied from 1 % to 14% (Table 6). There was no 
correlation of heteroplasmy level of mutation with levels of HbA lc , fasting plasma glucose, 
C-peptide, insulin, insulin resistance or the presence of sensorineural impairment (P>0,05) 
(Table 6). 

Families A and B 

These two families have been reported in a previous study (Smith, et al., (1997) 
supra). The 37-year-old proband (II-4) in family A had been treated with oral drugs since 
diagnosed at the age of 32 years. The mother (1-2) and two siblings (II- 1 and II-3) were 
diabetic while the father (1-1) and one sister (II-2) had IGT. However, none of the family 
members had the mt3243 mutation although the mother had a history of hearing loss. 

The proband of family B was diagnosed as having diabetes at the age of 22 years 
and had been treated with oral drugs. The mother was diabetic and deaf but was not 
available for screening. 
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Family C 

The 38-year-old proband (III-3) was diagnosed with diabetes at the age of 31 years. 
She was treated with diet and oral drugs for 6 years before being changed to insulin 
therapy. Audiometry revealed bilateral high tone sensorineural impairment. The father 
(II-l) had normal glucose tolerance and did not have the mt3243 mutation. The older sister 
(in-2) and the mother (II-2) both developed diabetes at about 40 years of age and the 
grandmother had diabetes at the age of 50 years. The mother became deaf at the age of 59 
years. None of these affected members were available for screening. 

Family D 

The 79-year-old proband was diagnosed with diabetes at the age of 70 years. An 
audiometry test revealed high tone sensorineural impairment. Neither of the parents and 
nor any siblings were available for screening or known to have diabetes. 

Family E 

The 35-year-old proband (II-3) was diagnosed with diabetes at the age of 33 years 
and was treated with diet. One of the sisters (II-4) had normal glucose tolerance while two 
sisters (II-l and II-2) were diagnosed to have diabetes at the age of 30 and 38 years, 
respectively. The father (1-1) and mother (1-2) also had diabetes at the age of 50 and 35 
years, respectively. All the diabetic and nondiabetic sisters who came for screening had the 
mt3243 mutation. One of the diabetic sisters (II-l) had high tone sensorineural impairment 
whereas the audiogram of the proband was normal. 

Example 3 

The role of the amylin gene S20G mutation in early onset Type 2 diabetes 
and in the regulation of cholesterol metabolism in Chinese 

This example illustrates the distribution of the amylin gene S20G mutation in Hong 
Kong Chinese with or without Type 2 diabetes, and its influences on (5-cell function and 
metabolic profiles. The data are consistent with the conclusion that the S20G mutation in 
the amylin gene may contribute to early occurrence of Type 2 diabetes, and that it may also 
influence lipid metabolism in the Chinese population. 
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Experimental Design and Methods 
Subjects 

The study protocol was approved by the Clinical Research Ethics Committee of the 
Chinese University of Hong Kong, Informed consent was obtained from each of the 
participants. For the study, 227 early- and 235 late-onset Type 2 diabetic patients (defined 
as age at diagnosis < 40 and > 40 years respectively), as well as 126 non-diabetic subjects 
(defined as fasting plasma glucose <6 mmol/1), were consecutively recruited at the Diabetes 
Centre of the Prince of Wales Hospital. Type 2 diabetes was asscertained according to the 
World Health Organisation criteria (Anonymous (1997) Diabetes Care 20:1183). None of 
the patients had typical presentations of Type 1 diabetes, such as acute symptoms and 
heavy ketonuria ( >3+), history of diabetic ketoacidosis or continuous need for insulin 
treatment within 1 year of diagnosis. Patients who had anti-glutamic acid decarboxylase 
autoantibody (Ko, ex al. (1998) Ann Clin Biochem 35:761) diabetes-causing mutations in 
the glucokinase and hepatonuclear factor-la and -4a genes (Ng et al. (1999) Diabetic 
Medicine 16:956) were excluded from the study. 

Clinical and biochemical measurements 

Patients fasted at least 8 hours prior to their clinical examinations. Blood pressures 
were taken, after they remained sitting for at least for 5 min using a standard mercury 
sphygmomanometer. Body height and weight, and waist and hip circumferences were taken 
while the patients were standing in light clothing but wearing no shoes. Measurements of 
fasting plasma glucose and lipids were performed by routine assays in the Department of 
Chemical Pathology at the Prince of Wales Hospital. Levels of total cholesterol and 
triglyceride were assayed enzymatically with commercial reagents (Centrichem, chemistry 
system, Baker Instrument Co., Allentown, PA). HDL-cholesterol was determined after 
fractional precipitation with dextran sulfate-MgCl 2 and LDL-cholesterol, calculated by the 
Friedewald's equation (Friedewald et aL (1972) Clin Chem 18:499). HbA lc was measured 
using an automated ion-exchange chromatographic method (BioRad, Hercules, CA, USA; 
normal range: 5.1-6.4%). Plasma levels of C-peptide were measured by 
radioimmunoassays using commercial kits (#7350104 and #141 respectively, Novo 
Nordisk, Denmark). The detection range was from 0.01 to 1.0 pmol/1. Insulin deficiency 
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was defined as fasting plasma C-peptide level <0.2 pmol/1 (Service et al. (1997) Diabetes 
Care 21:987). 

Mutation detection 

The S20G mutation creates a MSP I restriction fragment length polymorphism 
(RFLP), which can be detected using PCR-RFLP analysis (Sakagashira et al. (1996) 
Diabetes 45:1279). Briefly, DNA fragments spanning the mutation site were amplified by 
PCR using the primers 5 '-TC AC ATTTGTTCCATGTTAC-3 9 (SEQ ID NO:30) and 
5 '-C AATAACTAT AGAGTT ACATTG-3 9 (SEQ ID NO:31), at the annealing temperature 
of 56°C. Each of the PCR products was then digested overnight with 5 units of the 
restriction enzyme MSP I (#R6401, Promega, WI, USA) at 37 °C. Alleles were separated 
on 2.5% agarose gel. The wild-type allele showed a 239 bp product whereas the mutant 
allele showed 99 and 140 bp products. 

Statistical analysis 

Continuous variants were expressed as mean ± SD. Chi-square test was used for 
the analysis of proportions. Differences between continuous variables were analysed by the 
student's t-test using the statistical package for social sciences (SPSS Inc., Chicago, USA), 
a p value <0.05 was considered to be statistically significant. 

Results 

Table 7 summarises the demographic data of the subjects involved in the study. 
6 early- and 1 late-onset patients heterozygous for the amylin S20G mutation (2.6% vs 
0.4%, p =0.055) were identified. None of the non-diabetic subjects had the S20G mutation 
(Table 7). 

In the early-onset group, 5 out of the 6 mutation-carrying patients had satisfactory 
glycemic control with diet and/or oral drug medications, and had fasting plasma C-peptide 
concentrations of greater than 0.2 pmol/1 (Table 8). Moreover, the mutation carriers had 
lower total cholesterol (4.3 ± 0.9 vs 5.3 ± 1.1, p =0.02) and LDL-choIesterol (2.3 ± 0.7 vs 
3.2 ± 0.9, p=0.01) (Table 9) than those without the mutation. The patients with or without 
he S20G mutation were of a comparable age (34± 6 vs 35 ± 8, p>0.05). 
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The genetic association between the S20G mutation and early-onset Type 2 diabetes 
(Table 7; Sakagashira et al. (1996) supra) is consistent with the physiological data that 
amylin may play a role in the pathogenesis of the disease (Cooper (1994) Proc Natl Acad 
Sci 84:8628). Early onset of Type 2 diabetes is in fact not uncommon (Rosenbloom et aL 
(1999) Diabetes Care 22:345), although type 2 diabetes is classically a late-onset disease. 
However, early-onset patients appear to be heterogenous in etiology. In Hong Kong 
Chinese, in particular, maturity-onset Type 2 diabetes of the young as well as atypical 
autoimmune diabetes are present, but accounting for only a small proportion of the overall 
early-onset population (Ng et aL (1999) supra; Ko et aL (1998) supra). The S20G mutation 
may also explain some of the early-onset cases. 

The S20G mutation carriers usually did not require insulin for glycemic control, 
and did not appear to be insulin deficient (Table 8). These findings are different from 
previous observations that the S20G mutation may be associated with poor glycemic control 
as well as (5 cell dysfunction (Sakagashira et aL (1996) supra; Chuang et aL (1998) supra). 
The reported Japanese S20G carriers (Sakagashira et al. (1996) supra) had an average 
diabetes duration of approximately 20 years at the time they were tested. That they 
commonly required insulin treatment may be due to the deterioration in glycemic control 
during their long diabetes course, not necessary the presence of the mutation. 

Moreover, the mutation appears to be associated with lower plasma levels of total 
cholesterol and LDL-cholesterol (Table 9). This is in keeping with the recent finding that 
pramlintide (a synthetic human amylin analog) was able to lowers, plasma levels of total 
cholesterol and LDL-cholesterol in Type 2 diabetic patients (Thompson et al. (1998) 
Diabetes Care 21:987). Few studies to date have been focused on the relationships between 
amylin action and lipid profiles These data and those from Thompson and co-workers are 
consistent with the conclusion that amylin may also play a role in the regulation of 
cholesterol metabolism. 
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Example 4 

The significant roles of genetics and obesity in familial 
early-onset Type 2 diabetes in Chinese patients 



This example illustrates the prevalence of known molecular defects in separate 
cohorts of Chinese patients with early- and late-onset familial Type 2 diabetes. The genes 
studied are those that have been found to be associated with diabetes and which may 
contribute to early onset of the disease under gene-gene and gene-environmental influences, 
including glucokinase (MODY2), HNF-la (MODY3), and the A3243G mutation in the 
niitochondrial DNA coding for tRNA Uu(UUR) (mt3243) that has been associated with a form 
of diabetes characterized by maternal inheritance and deafness (van den Ouweland, et al 
(1992) Nature Genet 1:368). 

Experimental Design and Methods 
Subjects 

The Prince of Wales Hospital (PWH) is a regional teaching hospital in Hong Kong. 
Its catchment area has a population of 1.2 million, accounting for 20% of the total 
population in Hong Kong. There is a lack of long term health care programs in Hong 
Kong, and medical insurance is not widely available. Many patients with chronic diseases 
such as diabetes are managed in public hospitals or clinics where they pay only a nominal • 
fee. Hence, except for high social classes, the patients are largely representative of the 
diabetic population in Hong Kong. Since 1995, all patients attending the diabetes clinic of 
he PWH have been entered into the PWH Diabetes Registry after undergoing a structured 
issessment (Piwernetz, et al (1993) Diabetic Med 10:371; Chan, et al. (1997) Hosp Auth 
Qual Bull 2:3). During the study period, a separate cohort of 150 young patients with 
sarly-onset diabetes (age < 40 years and age at diagnosis < 35 years) who underwent the 
itructured assessment were recruited consecutively from the diabetes clinics at the PWH to 
brm the Young Chinese Diabetes Database (Ko, et al. (1998) Ann Clin Biochem 35:761). 
The 150 cases in the Young Chinese Diabetes Database, 92 and 53 patients, respectively, 
/ere selected for the present study as they satisfied the following criteria: All these 145 
oung patients had early-onset (current age and age at diagnosis < 40 years) Type 2 
iabetes (1985 WHO criteria, Geneva) and a positive family history for diabetes (at least 1 
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first degree relative with diabetes). Patients with classical Type 1 diabetes (acute ketotic 
presentation or continuous requirement of insulin within 1 year of diagnosis) were excluded 
from the study. 

The prevalence of anti-GAD (Ko, et al. (1998) supra), mt3243 (Smith, et al. (1997) 
Diabetic Med 14:1026; Ng, et al. (2000) 52:557) and amylin gene mutations (Lee, et al. 
(2001) /. Endocrinol) amongst patients from the Young Chinese Diabetes Database has 
been reported. Additionally, the prevalence of mt3243 (Ng, et aL (2000) supra), amylin 
(Lee, et al (2001) supra), glucokinase, HNF-ld and HNF-4a gene mutations (Ng, et al. 
(1999) Diabetic Med 16:956) in a separate cohort from the PWH Diabetes Registry has 
been reported. In this study, screening for glucokinase and HNF-lct gene mutations was 
extended to the 53 patients from the Young Chinese Diabetes Database. The HNF-4a gene 
was not screened in this cohort due to the expected low frequency of mutations, (None 
were found in the 92 patients from the PWH Diabetes Registry (Ng, et al. (1999) supra)). 
Screening for anti-GAD was extended to the 92 patients from the PWH Diabetes Registry. 

Nineteen (13%) of these 145 young patients with familial diabetes met the minimal 
criteria for MODY (age at diagnosis < 25 years and presence of diabetes in two 
consecutive generations). Altogether 10 out of 20 families with probands carrying putative 
diabetogenic gene mutations were recruited for a 75-gram OGTT and clinical assessment. 
The 1999 WHO classification was used to define the glycemic status of the family members 
(WHO, Geneva, 1999). For comparison of clinical characteristics of the early-onset 
patients, 290 sex-matched patients with late-onset diabetes (age at diagnosis > 40 years) 
and a positive family history of diabetes were randomly selected from the current 1800 
cases in the PWH Diabetes Registry. One hundred healthy Chinese (age 33 ± 10 years, 40 
males and 60 females) were selected as control subjects from hospital staff and students for 
screening for the gene variants identified in the study patients. Informed consent was 
obtained from each subject for a blood sample to be taken for DNA extraction and 
measurement of biochemical indices. This study was approved by the Clinical Research 
Ethics Committee of The Chinese University of Hong Kong. 
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Clinical studies 

All patients underwent a structured assessment based on the Europe DiabCare 
Protocol. They had documentation of their family history of diabetes, age at diagnosis and 
anthropometric indices (Piwernetz, et al (1993) supra; Chan, et al (1997) supra). Body 
mass index (BMI) was used as an index of general obesity. Waist circumference, which is 
highly correlated in Chinese with visceral fat accumulation measured by magnetic 
resonance imaging (Anderson, et al (1997) Diabetes Care 20: 1854), was used as an index 
of central obesity. After an overnight fast, venous blood was sampled for measurement of 
plasma glucose, insulin, HbA lc , total cholesterol (TC), HDL-C, LDL-C (calculated), 
triglyceride (TG) and anti-GAD. A morning spot urine sample was collected for assessment 
of albuminuria. Retinopathy and sensory neuropathy were assessed as previously described 
(Ko, et al (1999) J Diabetes Complications 13:300). 

General obesity was defined as a BMI > 25 kg/m 2 using the recent Asian criteria 
(WHO, Western Pacific Region, 2000). Albuminuria was defined as an albuminxreatinine 
ratio (ACR) > 3.5 mg/mmol in a spot urine sample (Schwab, et al (1992) Diabetes Care 
15:1581). The HOMA IR index (fasting plasma insulin x glucose / 22.5) derived from the 
HOMA equation was used to assess insulin resistance (Matthews, et al (1985) Didbetologia 
28:412). 

Biochemical assays 

Plasma glucose, HbA lc , lipids, urinary albumin and creatinine were measured by 
routine assays in the Department of Chemical Pathology at the PWH (see Chan, et al. 
(1996) Diabetic Med 13:150). Plasma insulin was measured in non-insulin treated patients 
by a radioimmunoassay (Pharmacia, Sweden) with intra- and inter-assay CVs of 6% and 
13.8%, respectively. Anti-GAD was measured by a radioimmunoprecipitation assay (Chen, 
et al (1993) Pediatr Res 34:785). The upper normal limit of 18 units, is applicable to 
Asian and European subjects (Tuomi, etal. (1995) Clin Immunol Immimopath 74:202, 
Chen, et al (1993) supra). 

Genetic Analysis 

The minimal promoter regions and exons of the glucokinase (f}-cell form), HNF-lct 
and HNF-4a (HNF4a2 form) genes were screened for mutations by direct sequencing of 
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PCR products (see Froguel, et al. (1993) N Engl J Med 328:697; Yamagata, et al. (1996) 
Nature 384:455; Yamagata, et al. (1996) Nature 384:458). One previously unreported 
mutation in HNF-la (Al 16V) and two previously unreported mutations in glucokinase 
(V101M and Q239R) were identified in this study. HNF-la A116V was screened by using 
the forward primer 5'-CATGCACAGTCCCCACCCTCA-3' (SEQ ID NO:34) and reverse 
primer 5'-TCCCACTGACTTCCTTTCC-3 , (SEQ ID NO:35) for PCR amplification 
followed by digestion with Hph\. The wild-type allele showed 44 and 397 bp products 
whereas the mutant allele showed 44, 136 and 261 bp products. Glucokinase V101M was 
screened by using the forward primer 5'-GTCCCTGAGGCTGACACACTT-3' (SEQ ID 
NO:24) and reverse primer 5'-AGCTGGGCCCTGAGATCCTGCA-3' (SEQ ID NO:25) 
for PCR amplification followed by digestion with Hsp91U. The wild-type allele showed 20, 
56 and 174 bp products whereas the mutant allele showed 20, 42, 56, and 132 bp products. 
Glucokinase Q239R was screened by using the forward primer 
5'-AGGAACCAGGCCCTACTCCG-3' (SEQ ID NO:36) and reverse primer 
5'-TACTCCAGCAGGAACTCGTCC-3' (SEQ ID NO:37) for PCR amplification followed 
by digestion with Acil. The wild-type allele showed 70 and 134 bp products whereas the 
mutant allele showed 33, 70 and 101 bp products. The occurrence of putative mutations in 
family members of the probands (Table 10) and control subjects was determined by PCR- 
RFLP. Mt3243A->G and amylin gene S20G mutations were determined by PCR-RFLP as 
described (Sakagashira, et al (1996) Diabetes 45:1279; Smith, et al. (1997) supra). 

Statistical analysis 

Normally distributed data are expressed as mean ± SD. Data with skewed 
distributions were normalised by logarithmic transformation. The resultant means were 
antilogarithmically transformed and expressed as geometric mean together with 25 and 75 
percentiles. Chi square test and Student's unpaired t tests were used for between-group 
comparisons. A p value < 0.05 (2-tailed) was considered to be significant. All statistical 
analyses were performed using the Statistical Package for Social Sciences (SPSS for 
Windows, version 9.0). 
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Results 

Prevalence of putative gene mutations and anti-gad in patients with familial 
early-onset diabetes 

Amongst the 145 patients with familial early-onset diabetes, there were 20 (14%) 
with putative mutations, involving the HNF-la gene, 7 (5%), glucokinase gene, 6 (4%), 
mt3243, 4 (3%), and amylin S20G, 3 (2%). Anti-GAD was positive in 6 (4%). No 
mutation in the HNF-4a gene was found in the 92 patients from the PWH Diabetes 
Registry. All mutations identified in the HNF-la and glucokinase genes were previously 
unreported (Table 10). The HNF-la G20R and glucokinase Q239R mutations were found 
in 4 unrelated patients. None of these mutations were found in 100 healthy control subjects. 

Family cosegregation study of gene mutations 

Amongst the 20 patients carrying putative gene mutations, 10 families were 
recruited for cosegregation study (Fig. 9). Cosegregation of a mutation with clinical 
diabetes or glucose intolerance were observed in 4 families: HK10 with HNF-la 
IVS2nt-lG->A, YDM142 with glucokinase V101M, HK84 with glucokinase I110T and 
HK50 with mt3243. Segregation was inconclusive in the other 6 families. For the families 
HK54 with HNF-la R203H, YDM83 with mt3243 and CX216 with amylin S20G 
mutations, only the probands, and none of the diabetic or non-diabetic family members 
who presented for screening, carried the gene mutations. For YDM67 with glucokinase 
Q239R, HK61 with mt3243 and YDM99 with amylin S20G mutations, the mutations were 
found in both diabetic and non-diabetic family members. Amongst the 3 families with 
glucokinase mutations, all mutation carriers from the families YDM142 and HK84 had 
higher fasting plasma glucose concentrations (5.8 - 8.9 mmol/1) than those with no 
mutation (4.2 - 5.3 mmol/1). On the other hand, the 4 mutation-carrying siblings of 
proband YDM67 had a normal fasting plasma glucose concentration (4.0 - 5.6 mmol/1) 
irrespective of their glycemic status. 
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Clinical characteristics of patients with familial early -onset diabetes of unknown cause 
compared with familial late-onset diabetes 

Although 26 of the patients with early-onset diabetes carried putative gene 
mutations associated with diabetes or the autoimmune indicator, anti-GAD antibodies, the 
causes of diabetes in the other 1 19 patientsremain to be determined. These young patients 
with diabetes of unknown cause (age at diagnosis 30 ± 6 years) differed clinically from the 
290 late-onset patients (age at diagnosis 52 ± 8 years) (Table 11). Thus, despite a positive 
family history of diabetes in all patients in both groups, those with early-onset diabetes 
more frequently had a father with diabetes (39% vs. 22%) and a mother with diabetes 
(63% vs. 41%), but less frequently a sibling with diabetes (30% vs. 53%) (p < 0.001). 
The early-onset patients had a higher BMI but lower BP and increased prevalence of 
retinopathy and neuropathy as compared to the late-onset patients. The early-onset patients 
had better glycemic control (glucose and HbA lc ) as well as higher fasting insulin 
concentrations than the late-onset patients. Notwithstanding similar mean disease duration 
of only 4 years, both the early- and late-onset patients had a disproportionately high 
prevalence of albuminuria, 40% and 38%, respectively, as compared with the prevalence 
rates of other microangiopathic complications. Insulin resistance, as assessed by the 
HOMA IR index, was similar between the two groups of non-insulin treated patients. The 
proportion of patients treated with insulin was similar in both groups (8% vs. 7%) but 
fewer patients with early-onset diabetes were treated with oral drugs (33% vs. 61%, p < 
0.001) as compared to the late-onset group. 

Clinical characteristics of the patients with familial early-onset diabetes of unknown cause 
and familial late-onset diabetes classified according to obesity index 

Due to the high prevalence of general obesity in both early-onset patients of 
unknown cause and late-onset patients (55% and 46%, respectively), the association of 
obesity with cardiovascular risk factors and complications in these patients was further 
analyzed (Table 11). Amongst the early-onset patients, the obese patients had worse 
glycemic control (HbA lc ) as well as a higher systolic BP, a more adverse lipid profile 
(higher TG, lower HDL-C and higher TC/HDL-C), and higher fasting insulin than the 
non-obese patients. They were also more insulin resistant (HOMA IR index) and had a 
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higher prevalence of retinopathy and albuminuria than the non-obese patients. Amongst the 
late-onset patients, the obese patients had better glycemic control (glucose and HbAiJ than 
the non-obese patients. However, they had a higher systolic and diastolic BP, and a higher 
fasting insulin than the non-obese patients. The degree of insulin resistance and prevalence 
of complications were similar in the two groups. 

Example 5 

An illustration of a Chinese family with hepatocyte nuclear factor- la diabetes (MODY3) 
that emphasizes the need for early diagnosis and appropriate treatment 

This example reports the clinical course of HNF-la diabetes/MODY 3 in a Chinese 
family with early-onset diabetes and severe complications (Fig. 10) (Chan, a al. (1990) 
Diabetic Medicine 7:211). This family highlights the importance of early diagnosis and 
prompt treatment in the improvement of clinical outcome even in genetically susceptible 
subjects. 

Three family members in the proband's family had severe diabetic complications 
when they were referred for treatment. The proband (UI-5), 19 years of age, had severe 
proliferative retinopathy, heavy proteinuria (1.4 g protein a day) and necrobiosis lipoidica. 
She had been diagnosed with Type 2 (non-insulin-dependent) diabetes mellitus 3 months 
earlier and was treated with glibenclamide. Retinal photocoagulation treatment was initiated 
and she was started on insulin and an ACE inhibitor. She subsequently developed 
hypertension and progressed to end-stage renal disease requiring dialysis by the age of 30 
years. Her mean HbA lc was 8.0% over the years. She is currently receiving 42 units of 
insulin. 

Her older sister (III-2) had a vitreous haemorrhage and had been treated with insulin 
since diagnosis at the age of 24 years. She became blind and had nephropathy (0.8 g 
protein a day) 2 years later. She is currently treated with insulin (16 units) and an ACE 
inhibitor, and has a mean HbA lc of 6.4%. 

The subject's mother (II-3) had a glycosuria complicated pregnancy when she was 
33 years old. She was diagnosed to have Type 2 diabetes at the age of 38 years and was 
then treated with glibenclamide for 10 years. At the time of the study she had proliferative 
retinopathy, nephropathy, peripheral neuropathy, necrobiosis lipoidica, hypertension and 
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cataracts. Insulin treatment (20 units) was commenced and her HbA lc was reduced from 
17.2% to 9.2% within 8 months. Two months later, she had a myocardial infarction 
followed by progressive deterioration of cardiac and renal functions. She died of 
pulmonary edema and septicaemia with a gangrenous foot at the age of 52 years. 

The fourth daughter (m-6) had been treated with insulin since her incidental 
diagnosis of diabetes at the age of 12 years after a nasal polypectomy. She is currently 
receiving 68 units of insulin, and has mean HbA lc of 8 .8 % . 

Two other family members underwent screening by OGTT. The second daughter 
(III-3) has fluctuated between having normal glucose tolerance and IGT over the last 11 
years. A brother (m-7) had overt diabetes on screening with an initial HbA lc of 10.5%. 
Insulin was started after 3 months of dietary treatment. He is currently receiving 26 units 
of insulin, withameanHbA lc of 5.3%. 

One maternal uncle (11-4) was diagnosed with diabetes and hyperlipidnemia with 
thirst and polyuria at the age of 39 years. He has been treated with oral drugs since 
diagnosis, and has mean HbA lc of 8.4%. His children were not available for detailed 
genetic testing and clinical assessment. The affected members 0-4, III-6 and IH-7 (Fig 10) 
have remained free of complications despite all having had diabetes for more than 10 years. 

The father was also diagnosed with IGT. He was non-obese and had 
hyperlipidaemia. 

Sequencing of the HNF-l a gene in this family showed a novel splice acceptor site 
mutation (AG-kAA) in intron 2 (IVS2nt-lG-»A) which cosegregated with diabetes (Fig. 10) 
(Ng, et al. (1999) Diabetic Medicine 16:956). This mutation is expected to produce a non- 
functional mRNA. All the diabetic members, including the maternal uncle, (H-4, m-2, EQ- 
5, m-6 and EH-7) were heterozygous for this mutation but the father (II-2) and the daughter 
(m-3) with IGT did not have the mutation. Thus it is very likely that the mother (II-3) for 
whom no DNA sample was available also carried this mutation. As with other patients with 
HNF-la diabetes (Byrne, et al (1996) Diabetes 45:1503), most affected family members 
exhibited defective pancreatic beta-cell function as assessed by the glucagon stimulation 
test. The mother and all the affected siblings, except subject DI-2, were insulin deficient 
based on a definition of post-glucagon (1 mg intravenously) stimulated plasma C peptide at 
6 min of less than 0.6 nmol/1 (0.24—0.55 nmol/1 respectively) (Service, et al (1997) 
Diabetes Care 20:198). The brother, m-7, who was diagnosed with diabetes by OGTT was 
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also insulin deficient. All the HNF-l a mutant carriers, except JI-4, required insulin 
treatment for glycemic control. 

Although all affected family members carried the same HNF-la gene mutation, 
their clinical courses have varied tremendously. Severe complications were present in those 
family members whose diagnosis was delayed and who presumably had poor glycemic 
control before diagnosis (II-3, m-2 and IH-5). Complications were, however, absent in the 
uncle (II-4) and the younger siblings (m-6 and HI-7) despite now having had diabetes for 
more than 10 years (Fig. 10), who were promptly diagnosed and received treatment. This 
is in accordance with a recent report suggesting that poor glycemic control is associated 
with a twofold to threefold increased risk among MODY3 patients of developing 
microalbuminuria and retinopathy, respectively (Isomaa, et al (1998) Diabetologia 
41:467). 

It is noteworthy that both maternal grandparents (1-3 and 1-4) were diagnosed with 
diabetes diagnosed in their late 50s. The effect of this bilineality on the natural course of 
HNF-la diabetes in this family is uncertain. It is, however, possible the non-MODY 
maternal grandparent transmitted a modifier gene affecting the age at onset or severity of 
the diabetes in carriers with the HNF-l a mutation. The age at diagnosis of diabetes in this 
family was increasingly younger with successive generations despite all carriers being 
relatively non-obese. This earlier diagnosis could be due to ascertainment bias or, more 
likely, an epiphenomenon due to increasing westernisation of the Hong Kong lifestyle with 
increased intake of high fat food and decreased physical activity (Chan and Cockram (1997) 
Diabetes Care 20:1785). This highlights the important influence of environment interacting 
with genetics in the natural course of HNF-la diabetes. In conclusion, this report 
emphasizes the need for early diagnosis by glucose tolerance testing or genetic screening, 
and appropriate treatment in patients who have a strong family history of diabetes, 
especially those with early onset disease and insulin deficiency. 

The data presented in Examples 1-5 demonstrate a combination of genetic mutations 
that are uniquely associated with the increased risk of a Chinese individual to develop type 
2 diabetes. The mutations are exemplified by, but are not limited to G20R, Al 16V, 
lVS2nt->GA, R203H, S432C and I618M of HNF-la; V101M, I110T, A119D, Q239R and 
3385V of glucokinase; S20G of amylin; and A3243G of mitochondrial tRNA Uu(UUR) . 
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Mutations correlative with a genetic predisposition of a Chinese individual to develop 
type 2 diabetes are efficiently identified in Chinese families with a positive family history 
of the disease, but find use in screening any Chinese individual that is asymptomatic but at 
risk of developing diabetes. Methods for identification of a combination of at least two 
genetic mutations correlative with type 2 diabetes in a Chinese individual offers an 
important tool for clinicians, not only to initiate prophylactic therapies before the onset of 
overt diabetic symptoms, but also to design therapies that are directed to the specific 
etiology of the disease in each individual. 

All publications and patent applications mentioned in this specification are indicative 
of the level of skill of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same extent 
as if each individual publication or patent application was specifically and individually 
indicated to be incorporate by reference. 

The invention now having been fully described, it will be apparent to one of 
ordinary skill in the art that many changes and modifications can be made thereto without 
departing from the spirit or scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1. A microchip comprising: 

a combination of at least two different mutant nucleic acid sequences of a 
wild-type nucleic acid sequence, wherein each wild-type nucleic acid sequence 
encodes a protein involved in insulin secretion, wherein said gene comprises at least 
one mutation indicative of a predisposition for type 2 diabetes in a member of a 
Chinese population. 

2. The microchip according to Claim 1, wherein said nucleic acid sequences comprise 
nucleic acid selected from the group consisting of genomic DNA, complementary 
DNA and messenger RNA. 

3. The microchip according to Claim 1, wherein said type 2 diabetes is maturity onset 
diabetes of the young, 

4. The microchip according to Claim 1, wherein said microchip further comprises a 
genetic marker that uniquely identifies a member of a Chinese population. 

5. A microchip comprising: 

a combination of at least two different nucleic acid sequences, wherein each 
nucleic acid sequence encodes a gene product involved in insulin secretion wherein 
said gene comprises at least one mutation indicative of a predisposition for type 2 
diabetes in a human subject of a Chinese population, wherein said gene product is 
selected from the group consisting of a glucokinase, a hepatocyte nuclear factor la, 
an amylin and a mitochondrial tRNA(Leu)(UUR). 

6. A microchip comprising: 

at least one each of a combination of different nucleic acid sequences, 
wherein each nucleic acid sequence encodes a protein selected from the group 
consisting of glucokinase, hepatocyte nuclear factor la, amylin and mitochondrial 
tRNA(Leu)(UUR), wherein said glucokinase gene comprises at least one mutation 
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selected from the group consisting of V101M, I110T, A119D, Q239R, and G385V, 
and said hepatocyte nuclear factor la gene comprises at least one mutation selected 
from the group consisting of G20R, A116V, IVS2nt-G->A, R203H, S432C, and 
I618M, and said amylin gene comprises the mutation S20G, and said mitochondrial 
tRNA(Leu)(UUR) gene comprises the mutation A3243G. 

7. A microchip comprising at least one nucleic acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:7 and SEQ ID NO: 10. 

8. A microassay system comprising a microchip according to Claim 1, 5 or 6. 
9* A kit comprising a microchip according to Claim 1, 5 or 6. 

10. A nucleic acid primer comprised of SEQ ID NO:34. 

11. A nucleic acid primer comprised of SEQ ID NO:35. 

12. A nucleic acid primer comprised of SEQ ID NO:36. 

13. A nucleic acid primer comprised of SEQ ED NO:37. 

14. A nucleic acid probe that specifically anneals to a nucleic acid encoding a mutant 
gene of a wild-type gene involved in insulin secretion, wherein said mutant gene 
comprises at least one mutation indicative of increased risk for type 2 diabetes in a 
human subject of a Chinese population, and wherein said nucleic acid probe does 
not bind to said wild-type gene. 

15. An isolated nucleic acid encoding a mutant gene of a wild-type gene that encodes a 
protein involved in the secretion of insulin, wherein said mutant gene comprises at 
least one mutation associated with increased risk for type 2 diabetes in a subject of a 
Chinese population. 
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The isolated nucleic acid according to Claim 15, wherein said mutation is a single 
nucleotide polymorphism. 

The isolated nucleic acid according to Claim 15, wherein said mutation is selected 
from the group consisting of a missense, a nonsense, an insertion and a deletion 
mutation. 

18. The isolated nucleic acid accprding to Claim 15, wherein said wild-type gene 
encodes hepatocyte nuclear factor la, and said mutation is A116V. 

19. The isolated nucleic acid according to Claim 15, wherein said wild-type gene 
encodes glucokinase, and said mutation is selected from the group consisting of 
V101M and Q239R. 

20. An isolated nucleic acid encoding a mutant gene of a wild-type gene that encodes a 
protein involved in the secretion of insulin, wherein said mutant gene is selected 
from the group consisting of SEQ ID NO:2, SEQ ID NO:7 and SEQ ID NO: 10. 

21 . An isolated amino acid sequence encoded by a mutant gene of a wild-type gene 
encoding a protein involved in the secretion of insulin, wherein said mutant gene 
comprises at least on mutation associated with increased risk for type 2 diabetes in a 
member of a Chinese population. 

22. An antibody that specifically binds a protein encoded by a mutant gene of a wild- 
type gene encoding a protein involved in the secretion of insulin, wherein said 
mutant gene comprises at least on mutation associated with increased risk for type 2 
diabetes in a member of a Chinese population, and wherein said antibody does not 
bind to a protein encoded by said wild-type gene. 

23. A method of determining a genetic predisposition of a member of a Chinese 
population to develop type 2 diabetes, said method comprising the step of: 

contacting a sample comprising nucleic acid from said member with a 



16. 



17. 
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combination of at least two nucleic acid sequences, wherein each nucleic acid 
sequence encodes a mutant gene of a wild-type gene encoding a protein involved in 
insulin secretion, wherein each mutant gene comprises at least one mutation 
indicative of a predisposition of a member of a Chinese population to develop type 2 
diabetes, whereby identification of at least one of said mutations in said sample is 
indicative of a genetic predisposition for type 2 diabetes in said member of a 
Chinese population. 

24. A method for detecting an increased risk of an individual of a Chinese population 
with decreased insulin secretory function to develop type 2 diabetes, said method 
comprising the step of: 

contacting a sample comprising nucleic acid from said individual with a 
combination of at least two different nucleic acid sequences, wherein each nucleic 
acid sequence encodes a mutant gene of a wild-type gene encoding a protein 
involved in insulin secretion, wherein each mutant gene comprises at least one 
mutation indicative of a predisposition of a member of a Chinese population to 
develop type 2 diabetes, wherein identification of at least one of said mutations in 
said sample is indicative of an increased risk for type 2 diabetes in said individual of 
a Chinese population. 

25. The method according to Claim 23 or 24, wherein said combination of at least two 
different nucleic acid sequences are attached to a microchip. 

26. The method according to Claim 23 or 24, wherein said nucleic acid sample is 
obtained from bodily fluid or tissue. 

27. The method according to Claim 23 or 24, wherein said wild-type gene encodes a 
gene product selected from the group consisting of hepatocyte nuclear factor la, 
glucokinase, amylin and mitochondrial tRN A(Leu)(UUR) . 



28. 



A method of determining a genetic predisposition of a member of a Chinese 
population to develop type 2 diabetes, said method comprising the step of: 
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contacting a sample comprising nucleic acid from said member with a 
combination of at least two different nucleic acid sequences selected from thegroup 
consisting of G20R, A116V, IVS2nt-G->A, R203H, S432C, and I618M of 
hepatocyte nuclear factor la; V101M, I110T, A119D, Q239R, and G385V of 
glucokinase; S20G of amylin, and A3243G of mitochondrial tRNA(Leu)(UUR), 
wherein each nucleic acid sequence encodes a mutant gene of a wild-type gene 
encoding a protein involved in insulin secretion, wherein each mutant gene 
comprises at least one mutation indicative of a predisposition of a member of a 
Chinese population to develop type 2 diabetes, and wherein said identification of 
one of said mutations in said sample is indicative of a genetic predisposition for type 
2 diabetes in said member of a Chinese population. 

29. A method for detecting an increased risk of an individual of a Chinese population 
with decreased insulin secretory function to develop type 2 diabetes, said method 
comprising the step of: 

contacting a sample from said individual with a combination of at least two 
different nucleic acid sequences selected from the group consisting of G20R, 
A116V, IVS2nt-G-»A, R203H, S432C, and I618M of hepatocyte nuclear factor la; 
V101M, I110T, A119D, Q239R, and G385V of glucokinase; S20G of amylin, and 
A3243G of mitochondrial tRNA(Leu)(UUR), wherein each nucleic acid sequence 
encodes a mutant gene of a wild-type gene encoding a protein involved in insulin 
secretion, wherein each mutant gene comprises at least one mutation indicative of a 
predisposition of an individual of a Chinese population to develop type 2 diabetes, 
and wherein the identification of at least one of said mutations in said sample is 
indicative of an increased risk for type 2 diabetes in said individual of a Chinese 
population. 

30. A method for screening for genetic mutations in an individual of a Chinese 
population diagnosed with type 2 diabetes, said method comprising the steps of: 

contacting a sample from said individual with a combination of at least two 
different nucleic acid sequences, wherein each nucleic acid sequence encodes a 
mutant gene of a wild-type gene encoding a protein involved in insulin secretion, 
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wherein each mutant gene comprises at least one mutation indicative of a 
predisposition of a member of a Chinese population to develop type 2 diabetes, and 
wherein identification of at least one of said mutations in said sample is indicative 
of an etiology of said type 2 diabetes in said individual of a Chinese population. 

31 . The method according to Claim 30, wherein said individual has been diagnosed 
with maturity onset diabetes of the young. 

32. The method according to Claim 30, wherein said individual has at least one primary 
family member that has been diagnosed with maturity onset diabetes of the young. 

33. The method according to Claim 30, wherein said mutation is selected from the 
group consisting of a missense, a nonsense, an insertion and a deletion mutation. 

34. A method for screening for genetic mutations indicative of increased risk of an 
individual of a Chinese population to develop type 2 diabetes, said method 
comprising the steps of: 

contacting a sample from said individual with a combination of at least two 
different nucleic acid sequences selected from the group consisting of G20R, 
A116V, IVS2nt-G->A, R203H, S432C, and I618M of hepatocyte nuclear factor la; 
V101M, I110T, A119D, Q239R, and G385V of glucokinase; S20G of amylin, and 
A3243G of mitochondrial tRNA(Leu)(UUR), wherein each nucleic acid sequence 
encodes a mutant gene of a wild-type gene encoding a protein involved in insulin 
secretion, wherein each mutant gene comprises at least one mutation indicative, of a 
predisposition of an individual of a Chinese population to develop type 2 diabetes. 

35. A method for screening for a genetic predisposition to develop type 2 diabetes in an 
individual of a Chinese population having at least one primary family member that 
has been diagnosed with type 2 diabetes, said method comprising the steps of: 

contacting a sample comprising nucleic acid from said individual with a 
combination of at least two different nucleic acid sequences, wherein each nucleic 
acid sequence encodes a mutant gene of a wild-type gene encoding a protein 
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involved in insulin secretion, wherein each mutant gene comprises at least one 
mutation indicative of a predisposition of a member of a Chinese population to 
develop type 2 diabetes, and wherein identification of at least one of said mutations 
in said sample is indicative of a genetic predisposition to develop type 2 diabetes in 
said individual of a Chinese population. 
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Figure 1A 

U72612: HNF-la gene Exon 1 

1 tgccggccgg caggcaaacg 
61 gggccgcgtg gccctgtggc 

121 agctcctggc ggccctgctc 
181 gtgagccggg gccctacctc 
241 gcggcggtcg aggggagctg 
301 aggacgagac ggacgacgat 
361 acctcagccc tgaggaggcg 
421 gagccctgcc ccgtccccgc 
481 agcccccctt ctgagttgag 
541 gggacagggc ccatgagagc 
601 aactgctcct ctgcagagtc 
661 cnagggggac atttcccagg 
721 aatgcc 



caacccacgc ggtgggggag 
agccgagcca tggtttctaa 

a G20R 
gagtcagggc tgagcaaaga 
ctggctggag aaggccccct 
gctgagctgc ccaatgggct 
ggggaagact tcacgccacc 
gcccaccaga aagccgtggt 
tcccaggaga gcctagaggg 
tccccatgac cttcagcctt 
ccaggggtcc ttgcttggag 
ccaaatccca tgagcccagg 
gggtccaaga tgggagaaaa 



gcggctagcg tggtggaccc 
actgagccag ctgcagacgg 

ggcactgatc caggcactgg 
ggacaagggg gagtcctgcg 
gggggagact cggggctccg 
catcctcaaa gagctggaga 
ggagaccctt ctgcagtaag 
gcccccctca gctcctaacg 
tagcctagtt gctgggaagg 
gtttgagcct ccagcccctg 
cctttagccc agtccttggg 
agcagtgaat tcacaactca 



Figure 1B 

U72613: HNF-la gene Exon 2 

1 cacccaccca tccatccatc cgtccatcca 
61 catccatcca catatcttca tctgtgttgt 
121 atctgttcca gtgtctgtat ccataggcct 
181 caagtctctg tcctcatgac catgtgtctg 
241 cctcaccgtc cctgagtcta tgtgtaggcc 
301 tccccaccct cagagttgac aaggttccag 
361 gagacagccc ttgctgagca gatcccgtcc 
t A116V 

421 gtggcgaaga tggtcaagtc ctacctgcag 
481 gataccactg gcctcaacca gtcccacctg 
541 aagacgcaga agcgggccgc cctgtacacc 
601 cagcgtaagt aatgacccta ccccgcatct 
661 ctcataggtg ggggctggaa gcttcaccat 
721 aagtcagtgg gattcaacct gcatttatta 
781 ggagtanact tggtcctgaa catccaaaga 
841 tttttttaga ta 



cccattcatc cattcatcca ttcacccatc 

gtgtctgtgt atccatgttt ctaaaccttt 

gtgtccacgt ttgtcatgtg tgtgcgtcna 

tgtccctgtg tcctggcata aatgaccata 

cctgggctcc ataactgctt tcatgcacag 

cacccaggac cgcagcccca cctatgggga 

ttgccctctc ccagggagga cccgtggcgt 

cagcacaaca tcccacagcg ggaggtggtc 

tcccaacacc tcaacaaggg cactcccatg 

tggtacgtcc gcaagcagcg agaggtggcg 

tccctgggag ggcccaggac tctcccctaa 

ccccattaca cagacaggta gatggaaagg 

cctattctgc gccaggcact ctgtgggacg 

tgaatgaaat gggtccctgc tttctttttc 



Figure 1 
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U72614: HNF-la gene Exons 3 and 4 

1 cgtgactctg gaaaaatatg taagctctct 

61 gatagtaaat gtgccaaatc agaacaaatg 

121 aggatggtga gatcatatct tgggttggta 

181 ttgccttgaa cacaggttaa gaaagtgatg 

241 attagatgat ttc.taagttc tagctgtaag 

301 aagaatcaag ggcaaggtca ggggaatgga 

a IVS2nt 

361 ctcacggctt tctgtgcctg cagagttcac 

421 gcccacaggt gatgagctac caaccaagaa 

481 agcatcccag cagatcctgt tccaggccta 

541 gcgagagacg ctagtggagg agtgcaatag 

601 ttggtctggg ctgcggcaag gccaggggaa 

661 ctgtccagtt gccgagaact cctgatattg 

721 ttgaacctaa gcccattcct cgcagccccc 

781 atggatgctc acccaattcg attctctcta 

841 aatggagaga ctgaggtcag acagactgtc 

901 attggaaccc agatctgcca gcctcaaacc 

961 ccccttcatg cccaggacag ggttcctctg 

1021 ttctgcaggg cggaatgcat ccagagaggg 

1081 tccaacctcg tcacggaggt gcgtgtctac 

1141 gccttccggc acaagctggc catggacacg 

1201 ggacctgcgc tgcccgctca cagctcccct 

1261 aaggtccacg gtaagtggta tgtggggaca 

1321 gaggactgtc ccattgacag cagtcaccta 
1381 c 



gagcctcagc ttcttcatct gtacaatggg 
ctaatgctta cctgcagtct tgtactgaga 
ggaaagcatt cagggattga ttagtgatgt 
gcatgtgtgc tgtgtgtttg tcatcagtag 
ctcctctggt tcagcgccat ggcaatgaga 
cgagggaagg tgagagtggc cagtacccca 

ccatgcaggg cagggagggc tgattgaaga 
a R203H 

ggggcggagg aaccgtttca agtggggccc 
tgagaggcag aagaacccta gcaaggagga 
gtacaacggc gggcgggaaa cagtgctggt 
ggggaaggtg actctaggtc ctgtaaaagg 
gcttagcctg gcccagaaaa ttgagaatac 
ctgcaccntg gacaccaagc aaccccttcc 
caatcctatg gctcttttgc tcactttatg 
aattgcccaa ggtcacacag cagacctggc 
ctccggcaga gntcagcttc tcagaaccct 
agcctggcct ggaggctcat gggtggctat 
gtgtccccat cacaggcaca ggggctgggc 
aactggtttg ccaaccggcg caaagaagaa 
tacagcgggc cccccccagg gccaggcccg 
ggcctgcctc cacctgccct ctcccccagt 
agggacacgt gggaaggtgg gagggttggg 
aacctctttg cacgtcagtt tggttccatt 



Figure 2 



WO 02/072875 



3/13 



PCT/CN02/00158 



Figure 3A 

U72615: HNF-la gene Exons 5 and 6 

1 gcagctgacc cagggattgg caaaaggtag 
61 ggcagacagg cagatggcct aagcaaacca 
121 gcaggggagg gcagggaagt ggggtgctga 
181 cgctatggac agcctgcgac cagtgagact 
241 ttagtgacag tgtctacacc cctccaccaa 
301 agcctgctga gtacagaagc caagctggtg 
361 tcatctttcc ttggcaggga gattctggag 
421 ggccccccgg acacagcttg gcttcccctc 
481 ccctgtcagc accctgacag cactgcacag 
541 gcagccccag aacctcatca tggcctcact 

601 gcctgcctcc ctgggtccta cgttcaccaa 
661 gctggtgggg atgggtgggc acctgggtgg 
721 agctggaaaa gccactggga ctcattcatt 



aaacaaaggc agatttgctg gctgcataaa 

atggagtttg aagtgctgag ggctgtggag 

ggcaggacac tgcttccctc tccaggtgtg 

gcagaagtac cctcaagcag cggcggtccc 

gtgtccccca cgggcctgga gcccagccac 

agtgtccttg cttgtaagga aaacccaacc 

cagtccctag ggaggccctg tggggacccc 

gtaggtctca gcagctgggg gccccctccc 

cttggagcag acatccccag gcctcaacca 

tcctggggtc atgaccatcg ggcctggtga 
g S432C 

cacaggtgcc tccaccctgg tcatcggtaa 

gaggctcatg gggcaaccgc anaatccagg 

cattcattca ttcatacaac atgt 



Figure 3B 

U72618: HNF-la gene Exon 10 

1 tccagtgttc acagtaagat gtactcaggc 
61 ggaggctccc tttgttaaga accgagggta 
121 ctgtgatcca ggaggtgtgg ccctgcctcc 
181 tggggtgggt gtgggtgcct ggtgggtggc 
241 ccagcagcct ggtgctgtac cagagctcag 

g I618M 

101 catccaacca cagcgtcatc gagaccttca 
>1 aaccacggca cctgggccct ggggcctgta 



cagtccatgg gcggccgtgg accctggctg 

gaggtgtgac tttggggttc ctgttatgtg 

ccatcctgag tacccctagg gacaggcagg 

tagcagcctt gtttgcctct gcagtgtcct 

actccagcaa tggccagagc cacctgctgc 

tctccaccca gatggcctct tcctcccagt 
ctgcctgctt ggggggt 



Figure 3 
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Figure 4A 



AF041016: Glucokinase gene Exon 3 

1 tcccttgtgc cttccctcct cctctttgta 
61 agcccaaggc cagcctgtgg gtgtccctga 
121 agtcggggac ttcctctccc tggacctggg 

181 ggtgggagaa ggtgaggagg ggcagtggag 

a All 

241 ccccgaggac gccatgaccg gcactgctga 
301 gggcaaggca tgcaggatct cagggcccag 
361 ccaggggtgg ggagagcggg ggctggagga 
421 gggtgaacag gact 



atatccggct cagtcacctg gggcccaccc 
ggctgacaca cttctctctg tgcctttaga 
tggcactaac ttcagggtga tgctggtgaa 
a V101M I110T c 

cgtgaagacc aaacaccaga tgtactccat 
9D 

gatggtgagc agcgcagggg ccggggcagg 
ctagtcctga cgggaggtgc cacctgtcta 
ccacccagcc tcagaggcag ctggaggcct 



Figure 4B 



AF041019: Glucokinase gene Exon 7 

1 ggcaggaacc aggccctact ccggggcagt gcagctctcg 



61 caccccaggc 

121 ggagggggac 

181 cgagctggac 

241 cggtcagcag 

301 cagcggcaga 

361 gaaaagcaaa 

421 cctgtgagag 

481 ccgacaggtg 



acgggctgca 
gagggccgca 
gagttcctgc 
ctgtaaggat 
tgggagccgg 
ctgacaatcc 
atgaagggtt 
at 



atgcctgcta 
tgtgcgtcaa 
tggagtatga 
gcccccctcc 
gccattgcag 
gttcgtaagc 
agtcccagtt 



catggaggag 
taccgagtgg 
ccgcctggtg 
cccacaaccc 
ataatgggct 
tccatccctt 
gtgatgtgat 



ctgacagtcc ccccgacctc 

g Q239R 
atgcagaatg tggagctggt 
ggcgccttcg gggactccgg 
gacgagagct ctgcaaaccc 
aggccctggg cgctctggtg 
tgtttttaaa caactctggg 
ctgctcagtc atgacctgcc 
aagcccagac ctctttcctt 



Figure 4G 



AF041021: Glucokinase gene Exon 9 

1 gctgggggac ggctggccgg ggcccctccc 
61 atggactgtc ggagcgacac tcagcgaccg 
121 ggcgaccgca agcagatcta caacatcctg 
181 gactgcgaca tcgtgcgccg cgcctgcgag 
t G385V 

241 tcggcggggc tggcgggcgt catcaaccgc 
•301 cgcatcactg tgggcgtgga tggctccgtg 
361 gctctctccc tggtaaagtg gggcccaaaa 
421 caagctccaa gatttcgtag tcctcttctc 
481 tcgactgcgt gcagggcgcc cggtaatgaa 



tggagaacga gaggccgccg ctggaggggg 

ccctacctcc tcccgccccg cagcgacacg 

agcacgctgg ggctgcgacc ctcgaccacc 

agcgtgtcta cgcgcgctgc gcacatgtgc 

atgcgcgaga gccgcagcga ggacgtaatg 
tacaagctgc accccaggtg agcctgcccc 
agcgcgcgct ccaaggttcc ttgcggttcc 
gtcccccttg gcctagattt gggggaaggg 
tgtggaggat gaggtgggag ga 



Figure 4 
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X52819: Amylin gene Exon 3 

1 tgtcaaaaaa tctcagccat 
61 aaattgtttt ccttttgggg 
121 tggatccagc taaaattcta 
181 tcaggtggaa aagcggaaat 

g S20G 

241 tttagttcat tccagcaaca 
301 tacatatggc aagaggaatg 
361 cctttagagg . acaatgtaac 
421 atttaacagt gcccttttca 
481 tgctaggaca tataccttct 
541 aataaaaaga tagtatcttt 
601 ataagaacgt cattttggga 
661 aaggtagttt gaaccttggt 
721 gaaaatcagt aattggacca 
781 gccgaggcag gcagatcaca 
841 ccctgtctct actaaaaata 
901 agctactcag gaggctaagg 
961 agccgagatt gcaccactgc 
1021 aagaaagaaa attagtaatt 
1081 cctgttaagc aattcctttt 
1141 caaagaaata aaatcctgct 
1201 tgtgcttgct ggtactaaga 
1261 tttcattgtg tgttagcagc 
1321 aagtggcttt cagcaaacct 
1381 ttctattaat cgtgtcttca 
1441 aagattattc tttgttaaca 



ctagggtgtt tgcaaccaaa 
tttttcaatc caattacaag 
aggctctaac ttttcacact 
gcaacactgc cacatgtgca 



cactgagtta cttatgtgaa 
aatatttgat gtcacatggc 
ttgttccatg ttaccagtca 
acgcagcgcc tggcaaattt 



actttggtgc 
cagtagaggt 
tctatagtta 
tctccagtgt 
caaaagattg 
taaaatgaaa 
cctatatctc 
aaattgtaaa 
ggcgcggtgg 
aggtcaggag 
caaaaattag 
caggagaatc 
actccagcct 
gtaagtaccc 
tgcagtatat 
cctgactcgg 
ggctatttaa 
agtgagcttc 
cagtcatatt 
attaaaagac 
aattagacat 



cattctctca 
tttaaagaga 
ttgttttatg 
gaatatatgg 
ttttatatgt 
tgtttttgct 
agtggcacag 
cagctaataa 
ctcttgcctg 
ttcgagacca 
ccgggggtgg 
gcttaaaccc 
gggtggcaga 
ctgataagca 
ttctgaaatg 
tcaaaatatt 
aagtataaaa 
tattaaatgt 
cttatgcagg 
cacagacttc 
tctagcaaag 



tctaccaacg 
gagccactga 
ttctagtgat 
tctgtgtgtc 
agtactaact 
atagatttgt 
gtttaagaac 
tgaagttatt 
taatcccagc 
gcctgaccaa 
tgacatgtgc 
aggaggcgga 
gtgagactcg 
aattagtaat 
acagaatgct 
ttttaaagtc 
ctgctttgta 
atatgtcatt 
gtattgcgaa 
tggaaactct 
t 



tgggatccaa 
attacttgcc 
ttcctgtata 
tgatgtttgt 
aaggtcccat 
attttaaaac 
gaaggagaaa 
cttgacatga 
actttgggag 
catggtgaaa 
ctgtaatccc 
ggttgcagtg 
tctcaaaaaa 
tgtcaatacc 
gttttaaaaa 
tattgtttgt 
tccatgaggg 
tattttgttt 
acaacttgtg 
ttgctgtata 



Figure 5 
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J01415 Human mitochondrion, complete genome 

3001 ggacatcccg atggtgcagc cgctattaaa ggttcgtttg ttcaacgatt aaagtcctac 

3061 gtgatctgag ttcagaccgg agtaatccag gtcggtttct atctaccttc aaattcctcc 

3121 ctgtacgaaa ggacaagaga aataaggcct acttcacaaa gcgccttccc ccgtaaatga 

3181 tatcatctca acttagtatt atacccacac ccacccaaga acagggtttg ttaagatggc 
g 3243 

3241 agagcccggt aatcgcataa aacttaaaac tttacacftca gaggttcaat tcctcttctt 

3301 aacaacatac ccatggccaa cctcctactc ctcattgtac ccattctaat cgcaatggca 

3361 ttcctaatgc ttaccgaacg aaaaattcta ggctatatac aactacgcaa aggccccaac 

34 21 gttgtzaggcc cctacgggct actacaaccc ttcgctgacg ccataaaact cttcaccaaa 



Figure 6 
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<222> 
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<223> 
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<400> 1 

tgccggccgg caggcaaacg caacccacgc ggtgggggag gcggctagcg tggtggaccc 
gggccgcgtg gccctgtggc agccgagcca tggtttctaa actgagccag ctgcagacgg 
agctcctggc ggccctgctc gagtcaaggc tgagcaaaga ggcactgatc caggcactgg 
gtgagccggg gccctacctc ctggctggag aaggccccct ggacaagggg gagtcctgcg 
gcggcggtcg aggggagctg gctgagctgc ccaatgggct gggggagact cggggctccg 
aggacgagac ggacgacgat ggggaagact tcacgccacc catcctcaaa gagctggaga 
acctcagccc tgaggaggcg gcccaccaga aagccgtggt ggagaccctt ctgcagtaag 
gagccctgcc ccgtccccgc tcccaggaga gcctagaggg gcccccctca gctcctaacg 
agcccccctt ctgagttgag tccccatgac cttcagcctt tagcctagtt gctgggaagg 
gggacagggc ccatgagagc ccaggggtcc ttgcttggag gtttgagcct ccagcccctg 
aactgctcct ctgcagagtc ccaaatccca tgagcccagg cctttagccc agtccttggg 
cnagggggac atttcccagg gggtccaaga tgggagaaaa agcagtgaat tcacaactca 
aatgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
726 
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<211> 


852 


<212> 


DNA 


<213> 


human 


<220> 




<221> 


Unsure 


<222> 


(1) (I 


<223> 


n ss g, 


<400> 


2 



cacccaccca tccatccatc cgtccatcca cccattcatc cattcatcca ttcacccatc 60 

catccatcca catatcttca tctgtgttgt gtgtctgtgt atccatgttt ctaaaccttt 120 

atctgttcca gtgtctgtat ccataggcct gtgtccacgt ttgtcatgtg tgtgcgtcna 180 

caagtctctg tcctcatgac catgtgtctg tgtccctgtg tcctggcata aatgaccata 240 

cctcaccgtc cctgagtcta tgtgtaggcc cctgggctcc ataactgctt tcatgcacag 300 

tccccaccct cagagttgac aaggttccag cacccaggac cgcagcccca cctatgggga 360 

gagacagccc ttgctgagca gatcccgtcc ttgccctctc ccagggagga cccgtggcgt 420 

gtggtgaaga tggtcaagtc ctacctgcag cagcacaaca tcccacagcg ggaggtggtc 480 

gataccactg gcctcaacca gtcccacctg tcccaacacc tcaacaaggg cactcccatg 540 

aagacgcaga agcgggccgc cctgtacacc tggtacgtcc gcaagcagcg agaggtggcg 600 

cagcgtaagt aatgacccta ccccgcatct tccctgggag ggcccaggac tctcccctaa 660 

ctcataggtg ggggctggaa gcttcaccat ccccattaca cagacaggta gatggaaagg 720 

aagtcagtgg gattcaacct gcatttatta cctattctgc gccaggcact ctgtgggacg 780 

ggagtanact tggtcctgaa catccaaaga tgaatgaaat gggtccctgc tttctttttc 840 

tttttttaga ta 852 



<210> 


3 


<211> 


1381 


<212> 


DNA 


<213> 


human 


<220> 




<221> 


Unsure 


<222> 


(1) - . C 


<223> 


n = g, 


<400> 


3 



cgtgactctg gaaaaatatg taagctctct gagcctcagc ttcttcatct gtacaatggg 60 



gatagtaaat gtgccaaatc agaacaaatg ctaatgctta cctgcagtct tgtactgaga 



120 
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aggatggtga gatcatatct tgggttggta ggaaagcatt cagggattga ttagtgatgt 180 

ttgccttgaa cacaggttaa gaaagtgatg gcatgtgtgc tgtgtgtttg tcatcagtag 240 " 

attagatgat ttctaagttc tagctgtaag ctcctctggt tcagcgccat ggcaatgaga 300 

aagaatcaag ggcaaggtca ggggaatgga cgagggaagg tgagagtggc cagtacccca 360 

ctcacggctt tctgtgcctg caaagttcac ccatgcaggg cagggagggc tgattgaaga 420 

gcccacaggt gatgagctac caaccaagaa ggggcggagg aaccgtttca agtggggccc 480 

agcatcccag cagatcctgt tccaggccta tgagaggcag aagaacccta gcaaggagga 540 

gcgagagacg ctagtggagg agtgcaatag gtacaacggc gggcgggaaa cagtgctggt 600 

ttggtctggg ctgcggcaag gccaggggaa ggggaaggtg actctaggtc ctgtaaaagg 660 

ctgtccagtt gccgagaact cctgatattg gcttagcctg gcccagaaaa ttgagaatac 720 

ttgaacctaa gcccattcct cgcagccccc ctgcaccntg gacaccaagc aaccccttcc 780 

atggatgctc acccaattcg attctctcta caatcctatg gctcttttgc tcactttatg 840 

aatggagaga ctgaggtcag acagactgtc aattgcccaa ggtcacacag cagacctggc 900 

attggaaccc agatctgcca gcctcaaacc ctccggcaga gntcagcttc tcagaaccct 960 

ccccttcatg cccaggacag ggttcctctg agcctggcct ggaggctcat gggtggctat 1020 

ttctgcaggg cggaatgcat ccagagaggg gtgtccccat cacaggcaca ggggctgggc 1080 

tccaacctcg tcacggaggt gcgtgtctac aactggtttg ccaaccggcg caaagaagaa 1140 

gccttccggc acaagctggc catggacacg tacagcgggc cccccccagg gccaggcccg 1200 

ggacctgcgc tgcccgctca cagctcccct ggcctgcctc cacctgccct ctcccccagt 1260 

aaggtccacg gtaagtggta tgtggggaca agggacacgt gggaaggtgg gagggttggg 1320 

gaggactgtc ccattgacag cagtcaccta aacctctttg cacgtcagtt tggttccatt 1380 

c 1381 

<210> 4 
<211> 1381 
<212> DNA 
<213> human 

<220> 

<221> Unsure 
<222> (1) . . (1381) 
<223> n = g, a, t or c 



WO 02/072875 PCT/CN02/00158 

4/13 

<400> 4 

cgtgactctg gaaaaatatg taagctctct gagcctcagc ttcttcatct gtacaatggg 60 

gatagtaaat gtgccaaatc agaacaaatg ctaatgctta cctgcagtct tgtactgaga 120 

aggatggtga gatcatatct tgggttggta ggaaagcatt cagggattga ttagtgatgt 180 

ttgccttgaa cacaggttaa gaaagtgatg gcatgtgtgc tgtgtgtttg tcatcagtag 240 

attagatgat ttctaagttc tagctgtaag ctcctctggt tcagcgccat ggcaatgaga 300 

aagaatcaag ggcaaggtca ggggaatgga cgagggaagg tgagagtggc cagtacccca 360 

ctcacggctt tctgtgcctg cagagttcac ccatgcaggg cagggagggc tgattgaaga 420 

gcccacaggt gatgagctac caaccaagaa ggggcggagg aaccatttca agtggggccc 480 

agcatcccag cagatcctgt tccaggccta tgagaggcag aagaacccta gcaaggagga 540 

gcgagagacg ctagtggagg agtgcaatag gtacaacggc gggcgggaaa cagtgctggt 600 

ttggtctggg ctgcggcaag gccaggggaa ggggaaggtg actctaggtc ctgtaaaagg 660 

ctgtccagtt gccgagaact cctgatattg gcttagcctg gcccagaaaa ttgagaatac 720 

ttgaacctaa gcccattcct cgcagccccc ctgcaccntg gacaccaagc aaccccttcc 780 

atggatgctc acccaattcg attctctcta caatcctatg gctcttttgc tcactttatg 840 

aatggagaga ctgaggtcag acagactgtc aattgcccaa ggtcacacag cagacctggc 900 

attggaaccc agatctgcca gcctcaaacc ctccggcaga gntcagcttc tcagaaccct 960 

ccccttcatg cccaggacag ggttcctctg agcctggcct ggaggctcat gggtggctat 1020 

ttctgcaggg cggaatgcat ccagagaggg gtgtccccat cacaggcaca ggggctgggc 1080 

tccaacctcg tcacggaggt gcgtgtctac aactggtttg ccaaccggcg caaagaagaa 1140 

gccttccggc acaagctggc catggacacg tacagcgggc cccccccagg gccaggcccg 1200 

ggacctgcgc tgcccgctca cagctcccct ggcctgcctc cacctgccct ctcccccagt 1260 

aaggtccacg gtaagtggta tgtggggaca agggacacgt gggaaggtgg gagggttggg 1320 

gaggactgtc ccattgacag cagtcaccta aacctctttg cacgtcagtt tggttccatt 1380 

c 1381 

<210> 5 
<211> 774 
<212> DNA 
<213> human 



<220> 

<221> Unsure 



WO 02/072875 PCT/CN02/00158 

5/13 



<222> (1) . 
<223> n = 


. (774) 

g, a, t or 


c 








<400> 5 
gcagctgacc 


cagggattgg 


caaaaggtag 


aaacaaaggc 


agatttgctg gctgcataaa 


60 


ggcagacagg 


cagatggcct 


aagcaaacca 


atggagtttg 


aagtgctgag ggctgtggag 


120 


gcaggggagg 


gcagggaagt 


ggggtgctga 


ggcaggacac 


tgcttccctc tccaggtgtg 


180 


cgctatggac 


agcctgcgac 


cagtgagact 


gcagaagtac 


cctcaagcag cggcggtccc 


240 


ttagtgacag 


tgtctacacc 


cctccaccaa 


gtgtccccca 


cgggcctgga gcccagccac 


300 


agcctgctga 


gtacagaagc 


caagctggtg 


agtgtccttg 


cttgtaagga aaacccaacc 


360 


tcatctttcc 


ttggcaggga 


gattctggag 


cagtccctag 


ggaggccctg tggggacccc . 


420 


ggccccccgg 


acacagcttg 


gcttcccctc 


gtaggtctca 


gcagctgggg gccccctccc 


480 


ccctgtcagc 


accctgacag 


cactgcacag 


cttggagcag 


acatccccag gcctcaacca 


540 


gcagccccag 


aacctcatca 


tggcctcact 


tcctggggtc 


atgaccatcg ggcctggtga 


600 


gcctgcctcc 


ctgggtccta 


cgttcaccaa 


cacaggtgcc 


tgcaccctgg tcatcggtaa 


660 


gctggtgggg 


atgggtgggc 


acctgggtgg 


gaggctcatg 


gggcaaccgc anaatccagg 


720 


agctggaaaa 


gccactggga 


ctcattcatt 


cattcattca 


ttcatacaac atgt 


774 


<210> 6 
<211> 407 
<212> DNA 
<213> human 










<400> 6 
tccagtgttc 


acagtaagat 


gtactcaggc 


cagtccatgg 


gcggccgtgg accctggctg 


60 


ggaggctccc tttgttaaga 


accgagggta 


gaggtgtgac 


tttggggttc ctgttatgtg 


120 


ctgtgatcca ggaggtgtgg 


ccctgcctcc 


ccatcctgag 


tacccctagg gacaggcagg 


180 


tggggtgggt 


gtgggtgcct 


ggtgggtggc 


tagcagcctt 


gtttgcctct gcagtgtcct 


240 


ccagcagcct ggtgctgtac 


cagagctcag 


actccagcaa 


tggccagagc cacctgctgc 


300 


catccaacca 


cagcgtcatg 


gagaccttca 


tctccaccca 


gatggcctct tcctcccagt 


360 


aaccacggca 


cctgggccct 


ggggcctgta 


ctgcctgctt 


ggggggt 


407 



<210> 7 

<211> 434 

<212> DNA 

<213> human 



WO 02/072875 



6/13 



PCT/CN02/00158 



<400> 7 

tcccttgtgc cttccctcct cctctttgta atatccggct cagtcacctg gggcccaccc 60 

agcccaaggc cagcctgtgg gtgtccctga ggctgacaca cttctctctg tgcctttaga 120 

agtcggggac ttcctctccc tggacctggg tggcactaac ttcagggtga tgctggtgaa 180 

ggtgggagaa ggtgaggagg ggcagtggag catgaagacc aaacaccaga tgtactccat 240 

ccccgaggac gccatgaccg gcactgctga gatggtgagc agcgcagggg ccggggcagg 300 

gggcaaggca tgcaggatct cagggcccag ctagtcctga cgggaggtgc cacctgtcta 360 

ccaggggtgg ggagagcggg ggctggagga ccacccagcc tcagaggcag ctggaggcct 420 

gggtgaacag gact 434 

<210> 8 

<211> 434 

<212> DNA 

<213> human 

<400> 8 

ticccttgtgc cttccctcct cctctttgta atatccggct cagtcacctg gggcccaccc 60 

agcccaaggc cagcctgtgg gtgtccctga ggctgacaca cttctctctg tgcctttaga 120 

agtcggggac ttcctctccc tggacctggg tggcactaac ttcagggtga tgctggtgaa 180 

ggtgggagaa ggtgaggagg ggcagtggag cgtgaagacc aaacaccaga tgtactccac 240 

ccccgaggac gccatgaccg gcactgctga gatggtgagc agcgcagggg ccggggcagg 300 

gggcaaggca tgcaggatct cagggcccag ctagtcctga cgggaggtgc cacctgtcta 360 

ccaggggtgg ggagagcggg ggctggagga ccacccagcc tcagaggcag ctggaggcct 420 

gggtgaacag gact 434 

<210> 9 

<211> 434 

<212> DNA 

<213> human 

<400> 9 

tcccttgtgc cttccctcct cctctttgta atatccggct cagtcacctg gggcccaccc 60 

agcccaaggc cagcctgtgg gtgtccctga ggctgacaca cttctctctg tgcctttaga 120 

agtcggggac ttcctctccc tggacctggg tggcactaac ttcagggtga tgctggtgaa 180 

ggtgggagaa ggtgaggagg ggcagtggag cgtgaagacc aaacaccaga tgtactccat 240 

ccccgaggac gccatgaccg gcactgatga gatggtgagc agcgcagggg ccggggcagg 300 
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gggcaaggca tgcaggatct cagggcccag ctagtcctga cgggaggtgc cacctgtcta 360 
ccaggggtgg ggagagcggg ggctggagga ccacccagcc tcagaggcag ctggaggcct 420 
gggtgaacag gact 434 

<210> 10 

<211> 492 

<212> DNA 

<213> human 

<400> 10 



ggcaggaacc aggccctact 


ccggggcagt 


gcagctctcg 


ctgacagtcc 


ccccgacctc 


60 


caccccaggc acgggctgca 


atgcctgcta 


catggaggag 


atgcggaatg 


tggagctggt 


120 


ggagggggac gagggccgca 


tgtgcgtcaa 


taccgagtgg 


ggcgccttcg 


gggactccgg 


180 


cgagctggac gagttcctgc 


tggagtatga 


ccgcctggtg 


gacgagagct 


ctgcaaaccc 


240 


cggtcagcag ctgtaaggat 


gcccccctcc 


cccacaaccc 


aggccctggg 


cgctctggtg 


300 


cagcggcaga tgggagccgg 


gccattgcag 


ataatgggct 


tgtttttaaa 


caactctggg 


360 


gaaaagcaaa ctgacaatcc 


gttcgtaagc 


tccatccctt 


ctgctcagtc 


atgacctgcc 


420 


cctgtgagag atgaagggtt 


agtcccagtt 


gtgatgtgat 


aagcccagac 


ctctttcctt 


480 


ccgacaggtg at 










492 


<210> 11 
<211> 532 
<212> DNA 
<2 13 > human 












<400> 11 

gctgggggac ggctggccgg 


ggcccctccc 


tggagaacga 


gaggccgccg 


ctggaggggg 


60 


atggactgtc ggagcgacac 


tcagcgaccg 


ccctacctcc 


tcccgccccg 


cagcgacacg 


120 


ggcgaccgca agcagatcta 


caacatcctg 


agcacgctgg 


ggctgcgacc 


ctcgaccacc 


180 


gactgcgaca tcgtgcgccg 


cgcctgcgag 


agcgtgtcta 


cgcgcgctgc 


gcacatgtgc 


240 


tcggcggtgc tggcgggcgt 


catcaaccgc 


atgcgcgaga 


gccgcagcga 


ggacgtaatg 


300 


cgcatcactg tgggcgtgga 


tggctccgtg 


tacaagctgc 


accccaggtg 


agcctgcccc 


360 


gctctctccc tggtaaagtg 


gggcccaaaa 


agcgcgcgct 


ccaaggttcc 


ttgcggttcc 


420 


caagctccaa gatttcgtag 


tcctcttctc 


gtccbccttg 


gcctagattt 


gggggaaggg 


480 


tcgactgcgt gcagggcgcc 


cggtaatgaa 


tgtggaggat 


gaggtgggag 


ga 


532 
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<210> 12 
<211> 1481 
<212> DNA 
<213> human 










<400> 12 
tgtcaaaaaa 


tctcagccat 


ctagggtgtt 


tgcaaccaaa cactgagtta 


cttatgtgaa 


60 


aaattgtttt 


ccttttgggg 


tttttcaatc 


caattacaag aatatttgat 


gtcacatggc 


120 


tggatccagc 


taaaattcta 


aggctctaac 


ttttcacact ttgttccatg 


ttaccagtca 


180 


tcaggtggaa 


aagcggaaat 


gcaacactgc 


cacatgtgca acgcagcgcc 


tggcaaattt 


240 


tttagttcat 


tccggcaaca 


actttggtgc 


cattctctca tctaccaacg 


tgggatccaa 


300 


tacatatggc 


aagaggaatg 


cagtagaggt 


tttaaagaga gagccactga 


attacttgcc 


360 


cctttagagg 


acaatgtaac 


tctatagtta 


ttgttttatg ttctagtgat 


ttcctgtata 


420 


atttaacagt 


gcccttttca 


tctccagtgt 


gaatatatgg tctgtgtgtc 


tgatgtttgt 


480 


tgctaggaca 


tataccttct 


caaaagattg 


ttttatatgt agtactaact 


aaggtcccat 


540 


aataaaaaga 


tagtatcttt 


taaaatgaaa 


tgtttttgct atagatttgt 


attttaaaac 


600 


ataagaacgt 


cattttggga 


cctatatctc 


agtggcacag gtttaagaac 


gaaggagaaa 


660 


aaggtagttt 


gaaccttggt 


aaattgtaaa 


cagctaataa tgaagttatt 


cttgacatga 


720 


gaaaatcagt 


aattggacca 


ggcgcggtgg 


ctcttgcctg taatcccagc 


actttgggag 


780 


qccqaaac acr 


gcagatcaca 


aggtcaggag 


ttcgagacca gcctgaccaa 


catggtgaaa 


840 


ccctgtctct 


actaaaaata 


caaaaattag 


ccgggggtgg tgacatgtgc 


ctgtaatccc 


900 


agctactcag 


aacrcrctaaaa 


caggagaatc 


gcttaaaccc aggaggcgga 


ggttgcagtg 


960 


agccgagatt 


gcaccactgc 


actccagcct 


gggtggcaga gtgagactcg 


tctcaaaaaa 


1020 


aagaaagaaa 


attagtaatt 


gtaagtaccc 


ctgataagca aattagtaat 


tgtcaatacc 


1080 


cctgttaagc 


aattcctttt 


tgcagtatat 


ttctgaaatg acagaatgct 


gttttaaaaa 


1140 


caaagaaata 


aaatcctgct 


cctgactcgg 


tcaaaatatt ttttaaagtc 


tattgtttgt 


1200 


tgtgctxgct 


ggtactaaga 


ggctatttaa 


aagtataaaa ctgctttgta 


tccatgaggg 


1260 


tttcattgtg 


tgttagcagc 


agtgagcttc 


tattaaatgt atatgtcatt 


tattttgttt 


1320 


aagtggcttt 


cagcaaacct 


cagtcatatt 


cttatgcagg gtattgcgaa 


acaacttgtg 


1380 


ttctattaat 


cgtgtcttca 


attaaaagac 


cacagacttc tggaaactct 


ttgctgtata 


1440 


aagattattc 


tttgttaaca 


aattagacat 


tctagcaaag t 




1481 
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<210> 13 
<211> 480 
<212> DNA 
<213> human 

<400> 13 

ggacatcccg atggtgcagc cgctattaaa ggttcgtttg ttcaacgatt aaagtcctac 60 

gtgatctgag ttcagaccgg agtaatccag gtcggtttct atctaccttc aaattcctcc 120 

ctgtacgaaa ggacaagaga aataaggcct acttcacaaa gcgccttccc ccgtaaatga 180 

tatcatctca acttagtatt atacccacac ccacccaaga acagggtttg ttaagatggc 240 

agggcccggt aatcgcataa aacttaaaac tttacagtca gaggttcaat tcctcttctt 300 

aacaacatac ccatggccaa cctcctactc ctcattgtac ccattctaat cgcaatggca 360 

ttcctaatgc ttaccgaacg aaaaattcta ggctatatac aactacgcaa aggccccaac 420 

gttgtaggcc cctacgggct actacaaccc ttcgctgacg ccataaaact cttcaccaaa 480 



<210> 14 

<211> 21 

<212> DNA 

<213> human 



<210> 15 

<211> 20 

<212> DNA 

<213> human 

<400> 15 

cagtgcctct ttgctcaggc 20 



<210> 16 

<211> 21 

<212> DNA 

<213> human 



<400> 14 

ggcaggcaaa cgcaacccac g 



21 



<400> 16 

tgcctgcaga gttcacccat g 



21 



<210> 17 
<211> 30 
<212> DNA 



<213> human 



<400> 17 
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atctgctggg atgctgggcc ccacttgcaa 



30 



<210> 18 
<211> 21 
<212> DNA 



<213> human 



<400> 18 

tggagcagtc cctagggagg c 



21 



<210> 19 

<211> 21 , 

<212> DNA 

<213> human 

<400> 19 

gttgccccat gagcctccca c 21 

<210> 20 

<211> 21 

<212> DNA 

<213> human 



<210> 21 

<211> 21 

<212> DNA 

<213> human 

<400> 21 

accccccaag caggcagtac a 21 



<210> 22 

<211> 21 

<212> DNA 

<213> human 



<400> 20 

gtacccctag ggacaggcag g 



21 



<400> 22 

gggcaaggtc aggggaatgg a 



21 



<210> 23 
<211> 21 
<212> DNA 



<213> human 



<400> 23 

cagcccagac caaaccagca c 



21 
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<210> 24 

<211> 21 

<212>, DNA 

<213> human 

<400> 24 

gtccctgagg ctgacacact t 21 



<210> 25 

<211> 22 

<212> DNA 

<213> human 

<400> 25 

agctgggccc tgagatcctg ca 22 



<210> 26 

<211> 21 

<212> DNA 

<213> human 

<400> 26 

acctgggtgg cactaacttc a 2 * 



<210> 27 

<211> 26 

<212> DNA 

<213> human • 

<400> 27 

cggcccctgc gctgctcacc atctga 26 



<210> 28 

<211> 21 

<212> DNA 

<213> human 

<400> 28 

ggactgtcgg agcgacactc a 



<210> 29 

<211> 21 

<212> DNA 

<213> human 

<400> 29 

gcggttgatg acgcctgcca g 



<210> 30 
<211> 20 
<212> DNA 
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<213> human 
<400> 30 

tcacatttgt tccatgttac 20 



<210> 31 

<211> 22 
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